Vol. 27, No. 1 February, 1943 


Journal of Applied Psychology 


EDITED BY: DONALD G. PATERSON, UNIVERSITY OF MINNESOTA 


Consulting Editors 


Pau S. Acuities, Psychological Corporation; WaLTER V. Bincuam, A.G.O., War Department ; 
Harotp E. Burtt, Ohio State University; ArtHuR I. Gates, T. C. Columbia University; 
Joun G. Jenkins, University of Maryland; Irvine Lorce, T. C. Columbia University; 
Quinn McNemAR, Stanford University; W1LLarp C. OLson, University of Michigan; 
James P. Porter, Ohio University; Epwarpv K. StronG, JR., Stanford University; 
Morais S. VITELES, University of Pennsylvania; Josern Zupsin, N. Y. Psychiatric Hospital. 





Tabie of Contents 


The Eighth Nation-Wide Social Experimental Survey: H. C. Linx 
An Experimental Study of Transfer of Training in Motor Learning: 
P. Woopwarp 
The Use of the Minnesota Rate of Manipulation and the O’Connor Finger 
Dexterity Tests in the Selection of Package Wrappers: E. E. Guiskttt.. 
Selection of Sewing Machine Operators: M. L. Btum 
The Constancy of the Intelligence Quotient As Indicated By Retests of 130 
Children: M. E. ALLAN AND F. M. Younc 


An Outline for the Systematic and Comprehensive Study of a Personality: 


C. LeuBa 

The JNB Psychograph: J. N. Buck 
Psychology in Kansas High Schools: W. H. M1xese.u 
G. Stanley Hall, Psychologist and Educator: R. Zevics 
The Prediction of Differential Achievement in a Technological College: 

W. McGenee 
Interests of Business College Students: An Indirect Validation of the Strong 

tT ARNON TU fb, ke dad towcs de enpd dee heds scenes 93 

Scholastic Achievement and Measured Interests: R. C. Crossy 
Relation of Order of Appearance to Mark in Oral Tests: N. J. Poweti 
A Binocular Reading Test: G. SpacHE 
News and Notes 
Book Reviews 
New Books, Monegraphs. and Pamphlets 





Published Bi-monthly by The American Psychological Association, Inc. 
With the Cooperation of The American Association of Applied Psychology 
Prince and Lemon Sts., Lancaster, Pa., and Northwestera University, Evanston, Illinois 


Entered as second-class matter, May 7, 1935, at the post office at Lancaster, Pa., under the Act of March 3, 1879. 





PUBLICATIONS OF 


THE AMERICAN PSYCHOLOGICAL ASSOCIATION 
Wiiarp L. VALENTINE, Business Manager 


PsycHo.ocicaL REvIEw 
Hersert S. LANncFetp, Editor 
Princeton University 
Contains original contributions only, appears bi-monthly, January, March, May, July, 
September, and November, the six humbers comprising volume of abot aan 
Subscription: $5.50 (Foreign, $5.75). Single copies, $100." 

PsycHoLocicaL BULLETIN 

Joun E. Anverson, Editor 

University of Minnesota _ 
Contains critical reviews of books and articles, psychological news and notes, university 
notices, and announcements. fmt pale py (lo lesues), the anaaal volume comprising 
about 665 pages. acai corr rent heating see aarti ace ama 


work in some psychology. 
Subscription: $7.00 (Foreign, $7.25). Single copies, 75c. 


JourNAL OF EXPERIMENTAL PsyCHOLOGY 


S. W. FEeRnBexcer, Editor, on Leave, Francis W. Irwin, Acting Ediior 
University of ee ad 


Contains contributions of an experimental character. Appears monthly (since 
Jeouney, 2551), sure voles gunciines alam voblans of ax saaibte poesuining 520 
pages. 


Subscription: $14.00 ($7.00 per voiume; Foreign, $7.25). Single copies, $1.25. 


PsYCHOLOGICAL ABSTRACTS 
Watter S. Hunter, Editor 


Brown University 
gamba: 1 4 echaat hd marr lange Bean oe gpm one making a volume of about 
700 pages. The journal is devoted to the publication of non-critical abstracts of the 


world’s literature in and closely related subjects. 
Subscription 287.00 “) woe By $7.25). Single copies, 75c. 


PsyCHOLOGICAL MONOGRAPHS 
Joun F. Dasurett, Editor 
University of North Carolina 


Consist of longer researches or treatises or collections of laboratory studies which it i 
important to publish promptly and as units. ify Foi sen aay geomlgcd adit 
ing to their size. The Monocrapus appear at irregular intervals and are gathered into 


volumes of about 500 
‘ubscription: $6.00 per volume (Foreign, $6.30). 


Journat or ABNORMAL AND SoctaL PsycHoLocy 
Gorpon W. A.tport, Editor 
Harvard University 


Appears quarterly, January, , October, the four numbers comprising a volume 
of 560 pages. The joura | containy eign contributions in the field of abnormal and 


F remeron $5.00 o (Focun, $5.25). Single copies, $1.50. 
Journat or Appiiep PsycHOoLocy 
Donavp G. Paterson, Editor 
University of Minnesota. 
Covers the applications of psychology in business, industry, education, government, etc. 


eee ee April, June, August, October, and December. 
' Subscription: Poy Single copies, $1.25, 


cameceemaneengenee 
Subscriptions, orders, and business communications should be sent to 


THE AMERICAN PSYCHOLOGICAL ASSOCIATION, INC. 


NorTHWESTERN University, Evanston, ILLINOIS 


—EE 

















Journal of 
Applied Psychology 


Vol. 27, No. 1 February, 1943 

















The Eighth Nation-Wide Social Experimental Survey 


Henry C. Link 
Psychological Corporation 


In this survey 5000 personal interviews were made with men and 
women in 95 cities and towns selected to constitute a proper sample of 
the urban population. All interviews were made between November 11 
and 19 by 345 interviewers under the immediate direction of 98 associated 
psychologists acting as local supervisors. 

Two questionnaires. This study was made with two questionnaires, 
each of which was used with half the sample, or 2500 interviews. The 
distribution of these two samples was alike in all respects. The exact 
wording of the questions is given above the corresponding tables of re- 
sults. The results from both questionnaires will be presented in five 
logical groupings, under the following headings: 


1. How Leading Questions Determine Answers 
2. The Public Becomes More Optimistic 

3. The Growing Influence of Religion 

4. Public Consciousness of War Sacrifices 

5. Free Enterprise or Government Control? 


Purpose. This particular series of social experimental surveys was 
begun in February, 1937. The purpose was: (a) To develop and per- 
fect techniques in the measurement and evaluation of public opinion; 
(6) To obtain information about certain social trends of current im- 
portance. These studies have usually been made with from 4000 to 
5000 interviews. The previous one, ‘‘A Study of Public Opinion and 
Morale,’’ based on 4000 interviews, was published in this JournaL in 
December, 1941.1. The first two were reported in this JourNAL in Oc- 
tober, 1937.? 

i1Link, H.C. A Study of Public Opinion and Morale. J. appl. Psychol., 1941, 
25, 636-645, 


2 The Psychological Corporation. A Study of Public Relations and Social Atti- 
tudes. J. appl. Psychol., 1937, 21, 589-602. 


1 











a lg HO i ri Bag 


Se ly A A RET CRT 


Henry C. Link 


The cooperation of clients. In preparing these studies we have con- 
sulted our clients and friends and obtained from them valuable sugges- 
tions. Specific questions suggested by clients have therefore been 
included in these studies on the condition that The Psychological Corpora- 
tion was free to publish the results as seemed desirable. 

Private questions. A number of clients saw in these studies a con- 
venient vehicle for one or more questions for their exclusive use. There- 
fore we accepted a number of such commissions and the payments re- 
ceived helped to defray the cost of these studies and to make possible a 
more complete treatment of their results. In the present survey six 
clients paid for exclusive questions. This not only made it possible to 
pay full rates to supervisors and interviewers, but also to do a machine 
tabulation of results. 

Distribution of interviews. All interviews were made in the home, 
half with men and half with women, usually the husband or the wife. 
However, only one interview was made in any one home. The inter- 
views were distributed by cities and towns in such a manner as to con- 
stitute a good sample of the urban population, constituting about 56% 
of the total population of the United States. 

Distribution by income or socio-economic groups. In each locality 
interviews were distributed by income and socio-economic classifications 
as follows: 


A group, the highest 10% by income and other criteria 
B group, the upper middle 30% by income and other criteria 
C group, the lower middle 40% by income and other criteria 
D group, the lowest 20% by income and other criteria 


The D group, for example, represented people in the foreign language 
sections, colored sections, poorest tenement areas, poorest living quarters 
generally. The A group represented approximately 10% living in the 
largest houses, the best parts of the city, with two cars or more, two 
bathrooms or more, ete. The C income group represented the large 
industrial population consisting of skilled and semi-skilled factory work- 
ers, mechanics, people engaged in transportation and special services, 
the smaller tradesmen, etc. The B group consisted largely of white 
collar and professional or semi-professional workers. 

Income today cannot be considered the only criterion of these groups 
because, at the present time, the average earnings of the C group may be 
as large or larger than the average earnings of the B group. The ex- 
tent to which these four groups really represent different characteristics 
in our socio-economic structure is well reflected in the tables by income 
groups given hereafter. 


ok... 

















| 
) 


a 








The Eighth Nation-Wide Social Experimental Survey 3 


Distribution of 5000 Interviews 
By Geographic Areas 


No. Per Cent 
es hs Shee bas Chnaeeehehe + sms 1850 37 
si cksCb base abe badd Khe eens 1575 31 
co cee abc hiadahs 6n5 JUa base 6 ee som 1030 21 
Ee dati bWii aad sedede steed eek 545 11 
PT Miah adhere Os conde caemwe to 5000 100 
By Income Groups 
No. Per Cent 
LE re err errs eer Tt eee eee 500 10 
Scabies kbd Ci eate 040 Sc kane rent 1500 30 
iicbna eet Vkdendanehnediaieed «mens «ia 2000 40 
RR SF eye 1000 20 
ED v5.05 ve Xe epewaditedeae saa 5000 100 
By Size of City 
EE a, ee 1502 30 
EE is. «hb cas ce aceerm saunas 1102 22 
EP adwhiss cee seccckaceenan 474 9 
A a6 a\n nid dieses ee0hen ea ener 1127 23 
ah vskhassacseatavde’ uM 795 16 
errr rrr fo 5000 100 
By Sex 
i cneh iba be dine hy edhe kdueeee ie 2454 49 
EE Enea +46 3:4 Kaeo Ss Rees Coker ere 2546 51 
er ne Pe 5000 100 


How Leading Questions Determine Answers 


In a study made in 1934,° we found that the addition of one word to 
a question made a difference of 12 per cent in the answers. Such a dif- 
ference is trifling compared with differences of 22 per cent and 45 per 
cent such as are shown below. 

It is extremely difficult, if not impossible, to frame an opinion ques- 
tion which is not leading or which does not carry some element of bias 
or suggestion.* Even if it were possible, the setting in which it was 
placed, or the bias of the interviewer, would still influence the result. 

The critical problem is not whether a question is leading but whether 
it is fair. In other words, it must not be misleading. In the present 
study we framed two questions purposely leading or misleading. One 
of these was asked in one half of the interviews, the second in the other 
half. The first was: 

8 Link, H. C. How Many Interviews Are Necessary for Results of a Certain 
Accuracy? J. appl. Psychol., 1937, 21, 1-17. 


4 Link, H. C., and Freiberg, A. D. The Problem of Validity vs. Reliability in 
Public Opinion Polls. Publ. Opin. Quart., 1942, 5, 87-98. 








arr anaanaitaa ad a 


4 Henry C. Link 
. ‘*As you know, this war is costing a lot of money. Do you think that advertising in 
4 war time is a necessary or unnecessary expense?’’ 
ia 


42% said, necessary 

38 said, unnecessary 

. 20 said, both or don’t know 

j The introductory sentence, about the high cost of the war, gives a 
é definite bias to this question against an answer in favor of advertising. 
However, even without this introductory sentence, if this question had 
been asked in a questionnaire heavily loaded with questions on the ex- 
pense of the war, the bias might still have been the same. The other 
question was: 


**Do you think that advertising in war time is necessary or wnnecessary?’’ 


64% said, necessary 

26 said, unnecessary 

10 ~=said, don’t know 

Here the per cent of answers in favor of advertising is 22 per cent 

higher. Nevertheless, the phrase ‘‘in war time’’ may also create a bias 
j against advertising because of the frequently expressed belief that in war 
‘ time everything must be different. 
! In two other studies we have made this year, one in October and one 
; in April, people were asked the following question (question and results 
quoted with permission of the Association of National Advertisers) : 





**Do you think that during the war companies should give up advertising entirely, or 
do you believe that they should keep on with at least some of their advertising?’’ 


In In 4 
April October Answers 

10% 8% said, give up advertising 

83 87 said, keep on advertising 

7 5 said, don’t know 


Since all three questions given above are leading, the real problem is: 
‘Which question is the least misleading?’’ In our opinion, it is the 
third, and for the following reasons: 

In the first place, it does not compel people to choose between two 
implied extremes, complete discontinuance of advertising or advertising 
as usual. Here the alternate is to ‘‘keep on with at least some adver- 
tising.’’ Secondly, this question was included with other questions 
dealing with the role of advertising in war time. Also, sample advertise- 
ments illustrating war time uses were shown in connection with certain 
questions. It is reasonable to suppose that this procedure would produce 
a better informed, more intelligent response than would be forthcoming 
to the abrupt question: Do you think that advertising in war time is 
necessary or unnecessary ? 





The Eighth Nation-Wide Social Experimental Survey 5 


This procedure for obtaining an informed or considered response can 
be described as a psychological warming-up process. It has its mechani- 
eal counterpart in warming up an.airplane motor before flight. It is 
just as reasonable to warm up the individual’s mind as it is to warm up 
a motor before asking it to take the plane off the ground. 


The Public Becomes More Optimistic 


The study made in October 1941 showed a high degree of pessimism 
on the part of the public in respect to their economic future. The pres- 
ent study reveals an important change in people’s opinions about the 
future, amounting at one point to a complete reversal. The most radical 
change was in answer to the question: 


‘*When this war is over do you think that the people of this country will be better or 
worse off than they are now?’’ 


In In 
1941 1942 Answers 
13% 40% thought they would be better off 
69 37 thought they would be worse off 
18 23 had no opinion 


A further series of questions showed smaller changes, but all in the 
direction of greater optimism. The questions were: 


‘*How about jobs; will there be more, fewer, about the same?’’ 


1941 1942 Answers 
8% 20% thought there would be more 
74 . 50 thought there would be fewer 
11 18 thought about the same 
7 12 had no opinion 
‘* Will taxes be higher, about the same, or lower?’’ 
1941 1942 Answers 
67% 53% thought taxes would be higher 
17 25 thought taxes would be about the same 
9 11 thought taxes would be lower 
7 1l had no opinion 


‘Will wages be higher, about the same, or lower?’’ 


1941 1942 Answers 
10% 7% thought wages would be higher 
20 24 thought wages would be about the same 
60 59 thought wages would be lower 
10 10 had no opinion 

‘* Will our government be less democratic, more democratic, or the same as now?’’ 
1941 1942 Answers 
26% 17% thought our government would be less 

democratic 

19 32 thought it would be more democratic 
33 31 thought it would be the same as now 


22 20 had no opinion 








ae 


6 Henry C. Link 


The above questions were asked in one half of the interviews. The 
alternate questionnaire used with the other half of our interviews in- 
cluded the following question : 

‘*There has been a lot of talk about inflation and its effect on the value of money. 

If you have any money in life insurance or in savings of any kind, do you think 

that money will be worth as much or buy as much 2 years from now as it does 


——. SS * 


today?’’ 

1941 1942 Answers 

: 26% 36% answered Yes 
59 42 answered No 
15 22 had no opinion 


7 The 42% who answered ‘‘No’’ were then asked about how much they 
: thought their money would be worth. Their answers were: 
11% thought % as much 
15 thought % as much 
3 thought less than % as much 
1 thought more than % as much } 
12 had no opinion 
Various reasons might be given for this change to a more optimistic 
outlook, such as the better war news early in November, the election re- 
sults, higher employment, ceiling prices, continued war prosperity, etc. 
The situation is too complicated to permit us to venture even a guess. 
However, the fact that the public is more optimistic, whether rightly or 
wrongly, is important. It may even be regarded as an element in the 
morale of the people, and if it represents a less fatalistic attitude on their 
part toward their own future and a greater confidence in their own pow- 
ers to shape that future, it is an encouraging sign. 4 











The Growing Influence of Religion 


What people think about the growing influence of religion in the 
United States may also be considered as evidence of a growing optimism. 
The question asked in this case was: 


**Do you think religion is losing or gaining influence in the United States?’’ 


This question was also asked in several earlier studies so that we have 
comparable results and trends for a six year period as follows: 


Feb. a. Oct. Nov. 
1937 39 1940 1942 “ 
% %o Jo % 
EET eee 44 32 42 20 
OT eee 34 45 35 58 
DE cbs sccct useee 13 ms 16 10 
Don’t kmow .....0.- 9 12 7 12 


From the above it will be seen that the proportion of people who be- 
lieve that the influence of religion is growing in the United States is much 





The Eighth Nation-Wide Social Experimental Survey 7 


higher than it has been at any other time in this six year period, whereas 
the proportion who believe that it is losing influence is lower. If the 
influence of religion is synonymous with the presence of goodness or a 
better life, then this result may also be taken as an indication of a greater 
optimism on the part of the public at large. 


Public Consciousness of War Sacrifices 


The public has been told again and again during the past year that 
great sacrifices will be required of them to help win the war. To what ex- 
tent has the public actually been called on to make these sacrifices? To 
what extent are people conscious of the war so far as it affects them per- 
sonally? This study does not presume to answer these questions fully. 
However, it does throw some light on the subject. One of our questions 


was: 
‘*Is your family more prosperous today than two years ago, less prosperous, or the 
same?’’ 
In Oct. In Nov. 

"41 742 Answers 

38% 29% said, more prosperous 

47 47 said, about the same 

15 21 said, less prosperous 

3 did not know 


In short, after almost a year of war, 76% of the people consider them- 
selves as prosperous or even more prosperous than they were two years 


ago. The significance of this situation becomes even greater when we 
analyze the results by the four socio-economic or income groups, namely : 
Upper Lower 

Answers High Middle Middle Low 

% % % % 

More prosperous ............ 23 28 31 29 

As prosperous .............. 45 47 46 50 

Less prosperous ............. 31 22 19 18 

MEE Acc nscadersceveses 1 3 4 3 


The decrease in prosperity, or the consciousness of such a decrease, 
it will be seen, is greatest in the top income group, and least in the lowest 
group. It ranges from 31% to 18% who think that they are less pros- 
perous, and from 68% to 79% who think they are as prosperous or more 
prosperous than two years ago. 

However, even these differences by income groups are probably sec- 
ondary compared with the fact that the great majority in all four groups 
consider themselves as prosperous or more prosperous than before. To 
be sure, we have no objective definition of prosperity or what people 
mean by prosperity. However, when we consider the fact that 1942 in- 








i | 
| 


8 Henry C. Link 


. comes have increased $49,000,000,000 since 1939, of which only a rela- 
: tively small part has been or will be taken away by taxes, more prosperity 
probably means a larger net income or a higher purchasing power. 
The effect of higher income taxes and lower exemptions has not yet 
fully penetrated people’s consciousness. This may be seen from the re- 
sults to the following question : 


‘*Have you or your family figured out about how much U. 8. Income Tax you will 
have to pay in 1943 for this year, 1942?’’ 


Upper Lower 


Answers Total High dle Middle Low 
% % % % % 
Yes 30 50 39 26 16 
No 70 50 61 74 84 


ET RE ee 


Not only the total results, given at the left, but the results by the | 
four income groups are significant. As regards men and women, 65% 
| of the men interviewed said they had not as yet figured out their income 
tax, whereas 74% of the women interviewed said that they or their fam- 
ily had not yet made this estimate. All people, regardless of their an- 
swer to this question were then asked the following question: ‘‘Is it: 
twice as much as last year, three times as much, or about how many times 
as much?’’ The answers to this question seem to indicate that many 
people in the lower income groups do not realize how sharp the increase 
in their income tax for 1942 will be. 

Another aspect of sacrifice is the extent to which people have been 
unable to buy, with the money at their disposal, the things which they i 
would have liked to buy. To obtain some measure of this we asked a 
question in two different forms, question A in one half of the interviews 
and question B in the other half. Question A was worded to call for a 
free response other than sugar and tires, whereas question B, while sug- j 
gesting a half dozen common items, emphasized the ‘‘favorite brand.’’ 

The questions were : 


A. ‘‘Have you (or your wife) been unable to buy something you wanted, I mean be- 
cause there was a shortage due to the war—anything outside of sugar and tires I 
meanf’’ 
B. ‘‘Recently have you been unable to buy your favorite brand of the following 
products: coffee, soup, tea, canned fruits, canned vegetables, gasoline, or anything 
else?’’ 


In answer to the first form of the question, 40% said no; in answer 
to the second form, 27% said no. The results were very uniform for all j 
income groups, although the percentage of those who said they had no 
difficulty in buying what they wanted in both cases was higher for the 
lower income groups than for the higher income groups. The articles 




















The Eighth Nation-Wide Social Experimental Survey 9 


named and the frequency are of considerable interest as may be seen 
below: 


Q.A. Total Q.B. Total 
% % 
CD bc kbebeadee sede segcsvcces 36 a Syivss\ ei cendune + sachenen 60 
Canned products ..............+- 8 PE SSS ecedecsccccedevcsetcbeul 14 
iE was hte deetad deeadscbdoecesen 5 DE Aaa ns deme bans ode need deseo 19 
PE. <ived Hiael cs ocenvercabaue 5 NE CED Aiakke'b.n'o 42.0 0's sn nsencin 14 
| ES Per PPeTT Eee re 5 Canned vegetables ............... 11 
PE asdnhubosbacsscceusecsenee 5 Other canned goods .............. 1 
oe err 5 | RS Re era errr 8 
0 err ee 3 D655 scab awees son as Caakaans Ob 2 
Candy and chocolate ............. 3 SE ao ons a wk ON 606d 2 2 e0ba's 98 a-0 3 
Rubber articles ..........3.....+. 3 Food products ...........sssee0. 5 
| Sree re 2 PE Pio 5dns Dae e ee 0608 woes 1 
So | SY eee 2 Metal and metal products ........ 1 
Fuel oil, electric appliances, short- Tires, lard and soap, etc. ......... 5 
ening, spices, Soden, Cocoa, Ee CET Si cat vide kcentendcews 27 
Ovaltine, rice, Cocoa-Cola, ete. .. 14 
POD GE vcs eecccsccccccsoce 40 


The items people are most conscious of having difficulty with are the 
items in most frequent use, or items to the rationing of which they have 
not become accustomed. Tires and automobiles are hardly mentioned 
at all. A conspicuous result, however, is that people had had little or no 
difficulty, at the time of this study, in getting most of the things they 
wanted or even the specific brands of the items they wanted. 


Free Enterprise or Government Control? 


Under the title ‘‘Can We Save Free Enterprise’’ in The American 
Magazine of November 1941, Charles E. Wilson, then President of the 
General Electric Company, made an important plea for post-war plan- 
ning by private industry. He also made a practical proposal which, in 
the light of subsequent events, seems more important and more timely 
today than ever. To the extent that private industry can effectively 
plan its conversion or reconstruction, to that extent government control 
will be less necessary. 

Are people conscious of the relationship between free enterprise and 
government control? Do they appreciate or do they fear the trends 
towards State Socialism and an all powerful central government? This 
important issue is complicated today by the necessities of war. The 
following question puts the issue of government control vs. free manage- 
ment in an extreme form, namely: 


**Do you think that business companies will do a better job if they are allowed to 


keep on under their own management, or if the government takes them over and 
runs them completely?’’ 


67% said, if keep on under own management 
14 said, if government runs them completely 
19 said, don’t know 








10 Henry C. Link 


This result looks quite different when analyzed by income groups. 
The per cent in favor of ‘‘own management,’’ by income groups, ranges 
from 90 to 46. The per cent in favor of ‘‘complete government control’’ 
or ‘‘doubtful’’ ranges from 10 to 54. The details by four major income 
groups are: 


Upper Lower 
Answers High Middle Middle Low 
% % % % 
If keep on under own management.... 90 76 66 46 
If government runs them completely... 5 9 14 24 
BIS MW: gakeincssecepnansscckans 5 15 20 30 
No. of interviews ............... 250 750 1000 500 


The extent of reliance on a centralized government is most fully re- 
vealed in connection with post-war considerations. This may be seen in 
the answers to the following question which was asked also in the Oc- 
tober 1941 study : 


**Who do you think can do the best job in straightening things out after the war: 
The government in Washington, business leaders, labor union leaders, or others?’’ 


In Oct. In Nov. 
1941 1942 Answers 
47% 56% said, the government in Washington 
26 25 said, business leaders 
5 8 said, labor union leaders 
7 6 said, all three together 
17 17 mentioned others or had no opinion 
2000 2500 No. of interviews 


When we analyze these opinions by the four income groups, we have the 
following revealing picture: 


Upper Lower 
High Middle Middle Low 
Answers "41 °42 "41 42 "41 '42 "41 42 
J % Fo % J % % 9% 
Goverment in Washington ..... 33044 46 53 48 57 54 «64 
Business leaders .............. 45 40 81 31 24 23 13 10 
Labor union leaders ........... 2 6 2 7 6 8 8 8 


As compared with last October, there is a substantial increase in the 
proportion in every income group who think that the government in 
Washington can do the best job in straightening things out after the 
war. No matter how interpreted, this is not an overwhelming vote of 
confidence in business leadership or in the power of our system of free 
enterprise to bring us out of the post-war emergency. It is an expres- 
sion of reliance on a central government to furnish the necessary leader- 
ship and planning. In view of the fact that business leaders are not 
even convinced, in fact, are quarreling among themselves as to whether 
or not they should do any post-war planning now, this result is not al- 
together surprising. 























The Eighth Nation-Wide Social Experimental Survey 11 


In this study we tried to obtain a measure of how people responded to 
the term ‘‘free enterprise’’ in comparison with some alternate names. 
The question asked was as follows: 


‘*As you know, this country has the highest standard of living and is the richest 
country in the world. Which one of the following do you think would be the 
best name for this American system: Capitalism, Free Enterprise, Competitive 
Business, Independent Industry?’’ 


These four alternatives were chosen after a number of preliminary 
tests with these and other names. The results were: 
9% said, capitalism 
23 said, free enterprise 
20 said, competitive business 


18 said, independent industry 
30 didn’t know or said none 


Free enterprise, it is obvious, does not arouse an overwhelming response. 
The significant fact is that the largest percentage of people do not re- 
spond to any of these names. 

In the October 1941 study, ‘‘The Perils of Peace,’’ we made the fol- 
lowing statement under the heading, ‘‘The Futility of Generalities’’: 
‘‘The public cannot easily be stirred or influenced by speeches and 
pamphlets in defense of our present economic system as such. Broad 
generalities and statistics about the merits of the American Way, Free 
Enterprise, the Merits of Advertising, the Partnership of Capital and 
Labor, leave the people cold. Threats of the imminence of State Social- 
ism and Dictatorship fail to convince them. The great public neither 
fears nor understands State Socialism, nor does it crave or understand 
our system of free enterprise. 

‘*The long-pull job of education in fundamentals must, of course, be 
undertaken. However, for effective business leadership now, the stereo- 
types and generalities should be systematically eliminated from the 
vocabularies of business executives. To be sure, the fundamentals of 
life and democracy which such a generality as ‘free enterprise’ stands 
for are as permanent as the laws of nature. However, effective leader- 
ship by business executives requires the development of a new and more 
realistic vocabulary for expressing these permanent values.’’ 

The practical problem here is not only one of semantics but one of 
dynamics. It is the problem of giving to some name, by deeds as well 
as by words, a living content such as will become a conscious part of 
American thinking. Free planning for the post-war period may be an 
important step in this direction. 


Received January 28, 1943. 














An Experimental Study of Transfer of Training 
in Motor Learning * 


Patricia Woodward 
Children’s Aid Society and Conference of Catholic Charities, Pittsburgh, Pa. 


The aim of this study has been to determine whether a significant 
degree of transfer of training in motor learning occurs when the train- 
ing consists of learning a task which is identical with the ‘‘test’’ task 
in the pattern of motions involved but requires the manipulation of en- 
tirely different materials. 

The problem developed from an interest in the possibility of pro- 
viding general training in a vocational high school which would prepare 
the girls to perform routine work in various industries._ That the 
present training in trade schools is not entirely adequate was shown by 
Koepke (7) who, in a survey of 88 firms representing 33 different in- 
dustries in Minnesota, found that 72% of the workers fell into the semi- 
skilled group, and that only two of the numerous operations studied 
in these different industries were operations for which training was given 
in vocational schools. Although his findings cannot be considered as 
giving an accurate picture of the present situation, they still point out 
an existing problem. It seems likely that it is even greater for girls 
than previously because the training offered in trade schools is still 
limited for the most part to the traditional sewing, cooking, beauty shop, 
and occasionally office machine training, and yet girls are beginning to 
enter more skilled industrial work in increasing numbers. 

Observing that the basic movements employed on the different jobs 
tended to be very similar in character even when the jobs were selected 
from very diverse industries, Koepke suggested that the schools should 

* This is the essential portion of a dissertation submitted in partial fulfillment of 
the requirements for the degree of Doctor of Philosophy in the Department of Psy- 
chology in the Graduate School of the University of Pennsylvania, June, 1942. The 
writer is indebted to Dr. M. 8. Viteles and to Dr. Malcolm G. Preston, under whose 
direction this study was carried out. 

1 The need for such a training program was described to the writer by Miss Reth 
Sill, Principal of the Fleisher Vocational School for Girls, Philadelphia, who found 
that a certain group of girls, generally below averdge in mental ability, could not 
profit from the usual trade training offered in the school, and most often went into 
routine industrial work, sometimes failing to make a satisfactory adjustment to it. 


There was no training offered by the school which gave promise of helping them 
make a more satisfactory adjustment to the industrial situation. 


12 




















Transfer of Training in Motor Learning 13 


give general training in dexterity and bimanufiability, that is, the ability 
to use both hands at one time but at different tasks. 

Shaw (12) reports a training program in a plant in which specific 
training was given to operators who had to learn changes in a work proc- 
ess to which they had already become accustomed. General training was 
given to new workers entering the factory and consisted of preliminary 
training in the general use of tools, ‘‘the simultaneous use of both hands, 
and the arrangement of material’’ (126), followed by specialized train- 
ing in one of several kinds of work. Although she presents no data, 
she reports that upon completing the training ‘‘the girls have a general 
knowledge of the principles of motion economy and more particular 
knowledge of one of the classes of women’s work in the factory as well 
as being familiar with factory routine’’ (126). 

A training program such as Koepke recommends and Shaw describes 
assumes that transfer occurs between the general training and the 
specific industrial task. This fact has not been demonstrated as yet. 
The present study has been planned as a critical investigation of one 
aspect of the problem. If transfer occurs from general training to a 
specific task, one would also expect it to occur from one specific task to 
another if their basic motions are the same. 

The theoretical explanation for such transfer is to be found in Thorn- 
dike’s theory of identical elements, i.e., if the general training is effec- 
tive in producing improvement in a variety of specific tasks, it is because 
the elements of the specific tasks are practiced in the course of the 
general training. 

There is considerable disagreement about the meaning of the term 
‘identical elements,’’ whether they refer to the simplest elements of 
which a whole is composed, or to the problem as a whole, or to the gen- 
eral attitude or ‘‘set’’ toward the problem. James (6) does not consider 
that the identity of the end product, of the function performed, or the 
purpose served, insures actual identity of the elements, and suggests that 
it would be better ‘‘to describe abilities in terms of their constituents 
within the ability.’’ Woodworth (13) also favors using the term con- 
stituent or component instead of element. ‘‘ What the theory of identical 
elements demands is that transfer should be of concrete performances, 
whether simple or complex makes no difference to the theory’’ (177). 
Bayles (2) says it is necessary for the subject to comprehend the ele- 
ments in common if transfer is to occur. 

Very few of the previous studies in the field of motor learning have 
been planned as critical tests of the theory of identical elements. Crafts 
(4) used card-sorting, and studied the relationship between amount 
of transfer and the similarity of the tasks. He found the amount of 











14 Patricia Woodward 


transfer to be roughly proportional to the number of common elements, 
and held that the results supported Thorndike’s view ‘‘that the amount 
of transfer is directly related to the number of elements common to train- 
ing and test situations’’ (157). Freeman (5) attempted to test the 
theory with psycho-physiological elements rather than logical ones, but 
found only negative results. His use of highly practiced subjects seems 
to be a real limitation of his study, however, as results would be expected 
to show up much more clearly in the early part of a learning curve. 

There have also been very few studies utilizing materials taken di- 
rectly from an industrial situation or resembling one closely. Several 
of the British studies, particularly of Langdon (8) and Cox (3), have 
used such materials, but their primary interest has been to study the 
possible occurrence of a general factor or of group factors influencing the 
performance on a variety of motor tests. Langdon and Yates (9) used 
a training task similar to the assembling of bicycle chains, and tested the 
subjects on 15 different tests which were ‘‘chosen because of their sup- 
posed resemblance to different features’’ of the training task. No ob- 
jective criterion of similarity between the two tasks was used. No sig- 
nificant degree of transfer occurred although they found a common factor 
which ‘‘is mainly responsible for their (the tests) intercorrelation’’ (32). 

Cox used the various assembling processes involved in the manufac- 
ture of an electric light socket as his test and training tasks, and found 
no significant degree of transfer when the training task was repeated or 
‘*practiced.’’ However, when the subjects were given training in the 
general principles of handling materials, and fewer repetitions of the 
training task, he found a significant degree of transfer. The subjects in 
both studies were either grade school boys or girls, or college students, 
none of whom could be considered as similar to the worker entering in- 
dustry. 

The chief characteristics of the present study are that an objective 
method of motion analysis has been used to establish the close similarity 
between the test and training tasks, trade school girls have served as 
subjects, and the training has consisted only of repetition of the task, 
with no teaching of general principles of work, and no insight into the 
similarity of the materials nor the purpose of the training. 


Experimental Procedure 


Materials. The assemblies selected for this study had to satisfy the 
primary requirement of being as nearly as possible identical in the 
motions required to perform them. In addition each assembly had to 
satisfy the following criteria: (1) it must differentiate members of a 
group, 1.e., have a distribution approaching a normal one, (2) it must 























Transfer of Training in Motor Learning 15 


have a learning curve, so that gradual and distinct improvement oc- 
curs over a period of training, and (3) it must be objectively defined, 
preferably in terms of a time score, so that each subject would have the 
same amount of practice. One assembly, the safety switch, was selected 
from the East Pittsburgh shop of the Westinghouse Electric and Manu- 
facturing Company. It required a relatively simple jig to hold the 
parts being assembled, the use of a suspended screw-driver, and in- 
cluded variously shaped parts. No changes in the assembly were made 
except that a hand screw-driver was used in place of the electrically- 
driven one used in the shop. The second task was then constructed so 
that the sequence of motions was the same as that for the safety switch, 
but so that the parts were of entirely different materials, shape, and ap- 
pearance. 

Since the manipulation of string was an important part of this as- 
sembly, it was named the ‘‘loom.’’ It is considered to have some simi- 
larity to the textile industry. Table 1 shows the differences in the ma- 
terials used in the two assemblies. 








Table 1 
Materials Used in Safety Switch and Loom 
Safety Switch Loom 
Porcelain base Wooden base 
Metal parts—screws, nuts, etc. Rubber band 
Fiber washers Leather thongs 
Hard rubber screws 
Wooden pegs 
String 





As the loom was being constructed a very careful comparison was 
continually made with the safety switch assembly, and the attempt to 
reproduce the pattern of the whole assembly was considered of the great- 
est importance. Both have the same number of separate assemblies re- 
quiring the hands to reach out to pick up materials from the bins (for 
which the therblig is Transport Empty), bring them to a position over 
the jig (Transport Loaded), put them together (Assemble), and insert 
the assembled product in the base (Assemble). Each requires the screw- 
ing motion in the same place in the series, although one involves the use 
of a tool to perform this screwing motion, while the other requires the 
direct screwing of the parts. Each one requires the lifting up of the 
completed assembly out of the jig, placing it aside, and placing a new 
base in the jig. 

Two methods of motion analysis were used to describe the pattern of 
motions, the Gilbreth method (1), by which motions are described in 











16 Patricia Woodward 


terms of therbligs, or functions, and the method of occupation analysis 
developed in Great Britain (10), by which motions can be described in 
terms of the anatomical unit involved, and in terms of the direction and 
type of movement.? 

Two observers made these motion analyses in two ways, first, by 
means of motion pictures of the assemblies performed by the same prac- 
ticed subject, and second, by direct observation of the subject. It was 
felt that the analysis of the motion pictures produced slightly more reli- 
able results, as the direct observation of the subject necessitated the 
subject’s repetition of each portion of the assembly a number of times, 
as if it were a discrete part of the whole assembly. There were no dif- 
ferences in the therblig analyses of the two observers. The few differ- 
ences in the description of the motion generally occurred in defining the 
number of fingers involved in the motion, and in describing the motion 
of the forearm, which afterwards was agreed to be a rotation of the elbow 
and a hinging movement of the wrist. 

The differences in the analyses of the two assemblies were the follow- 
ing: (1) there was an additional operation in the loom at the beginning 
of the assembly, the placement of the wooden base in the jig. Since it 
came at the beginning of the assembly, it did not change the sequence 
of motions. (2) The use of the screw-driver in the safety switch as- 
sembly required a vertical movement of the arm which did not occur in 
the loom assembly, in which the screwing motion was started directly 
and was performed without the use of a tool. (3) The use of the string 
in the loom assembly meant that the names of the therbligs for these 
comparable parts were different, due to the fact that the string was car- 
ried forward and backward (TL in both directions), while in the safety 
switch the hand was empty when it reached forward (TE). The actual 
motions were the same. (4) The description of one motion as an elbow 
rotation and as a wrist rotation occurred in some parts of this analysis; 
the observers agreed, however, that no wrist rotation is possible, and 
that any rotation must start with the elbow. 

The average time necessary for the assembly of the safety switch on 
the first six trials is .0247 hours, and for the loom is .0391 hours, or, 
roughly, 89 and 140 seconds. The difference is undoubtedly attributable 
to the flexibility of the materials used in the loom assembly, as compared 
with the rigidity of those used in the safety switch. The manipulation 
of a rubber band, leather thongs, and string requires more time than a 


2 For example, a movement may be performed by the shoulder, elbow, wrist, or 
finger, may be horizontal or vertical, may be a rotation, a flexion, or a hinging move- 
ment, and may be right or left, or up or down. 




















Transfer of Training in Motor Learning 17 


similar combination of metal and hard fiber parts, although the pattern 
of manipulation is the same.* 

Arrangement of Materials. The materials were arranged in as anal- 
agous a way as possible to the actual shop situation (see Figure 1). 
Semicircular bins were used so that there was a separate compartment 
for each type of part. All the parts were arranged in order of use in 
the separate ‘compartments, starting with those on the sides; this facili- 





Fig. 1. Arrangement of work-place for assembly of the loom. 


tated learning of the assembly. Each jig was fastened to a large board 
which was clamped to the table top, and the bin was placed in a standard 


8 The reliability of the loom assembly was computed by correlating the averages 
of the first 3 assemblies with the averages of the second 3 assemblies on the initial 
test for the ‘‘eorrected’’ control and experimental groups, resulting in r’s of 
+ .73+ .05 and + .82+ .03, respectively. The r’s between the averages of the 
third group of 3 assemblies and the fourth group of 3 assemblies on the final test 
were + .80 + .03 (control) and + .85+ .03 (experimental). The r’s between the 
averages of all 6 assemblies on the initial test and the improvement (difference be 
tween average of first 6 and last 6 assemblies of the loom) shown are + .68 + .04 
(control) and + .73 + .05 (experimental). The correlation between the safety 
switch and loom, using the average of the first 6 assemblies on each, is + .60 + .06. 











18 Patricia Woodward 


position relative to the jig. A sharpened nail was driven into the edge 
of the board so that it projected about one-fourth to one-half inch and 
permitted rapid cutting of the string used in the loom assembly. The 
serew-driver, used only for the safety switch assembly, was suspended 
in a standard position, slightly off center to the right. 

Subjects. All 107 subjects were girls from two vocational high 
schools, to be referred to as the B. and I. schools.t The 74 subjects 
from the B. school were selected by the assistant principal as meeting the 
following requirements: between 14 and 18 years of age, I.Q. between 
75 and 95 on the Otis test, right-handed, and white. These requirements 
were chosen arbitrarily because they seemed to describe the group which 
would be most likely to enter routine industrial work. The reason the 
actual distribution of I.Q’s has a greater range than this one is that it 
was later discovered that some of the 1.Q’s had been obtained on tests 
administered at the school the girl had previously attended. All these 
subjects were retested. 

Aside from girls who were extremely irregular in attendance and 
were consequently not considered as possible subjects by the principal, 
and twenty-four girls who were absent from one of the experimental 
sessions, these subjects form almost the total group in the B. school which 
fell within these limits and came from the sewing trade classes. 

The assignment of the subject to the control or experimental group 
was made by the assistant principal, as she determined the day on which 
the subject was to start and consequently the group that she would be in. 
It seems extremely unlikely that any spurious factor influenced this de- 
cision as last-minute assignments were made frequently because of ab- 
sences. 

The 33 subjects from the I. school were selected from the school files 
by the writer as subjects who would be the closest ‘‘matches’’ for the 
experimental subjects whose records were already known. All were from 
sewing trade classes and were control subjects. 

For each subject the following information was recorded: age to the 
nearest month on the day she took the original test ; I.Q. on Otis test given 


4The writer is indebted to Miss Jane Bryce, Principal of the Bellefield and 
Irwin Vocational Schools, Pittsburgh, for her cooperation in making this study 
possible. 

The fact that both the B. and the I. schools have the same supervising principal 
and give the same trade training means that the subjects can be considered ay repre- 
sentative of one population. The supervising principal stated that she thought there 
was no difference in the training given in the two schools aside from the fact that 
different teachers give it, and that such a difference exists among the teachers in one 
school. The girls in both schools come from widely separated areas, and the decision 
as to which school a girl should attend is made largely on a geographical basis. 

















Transfer of Training in Motor Learning 19 


by the assistant principal within the previous year ; month of trade train- 
ing, counting one school year as nine months; interval of days between 
the original and final test; and trade rating. The trade rating was ob- 
tained from the trade teacher, who was asked to rate each subject from 
her class on a five-point scale in regard to the girl’s skill in using her 
hands and the speed with which she worked in class. 

Testing procedure. The subjects came in groups of five to the room 
in the school in which the experimental materials were set up. Several 
other girls, not eligible as subjects, accompanied each group and assisted 
the experimenter in disassembling the materials. The original and final 
testing sessions each lasted approximately 45 minutes; some of the train- 
ing sessions of the experimental groups took slightly less time than this. 

All the subjects came for the original and final tests, the interval be- 
tween them ranging from 9 to 13 days. The variation was due to the 
fact that the experimental group’s training lasted for five consecutive 
days and consequently week-ends and occasional holidays caused some 
variation in the schedule. Three groups of five subjects each were tested 
in a half-day at the school ; the half-day occurred approximately half the 
time in the morning and half in the afternoon. Uniform instructions 
were used throughout the whole testing procedure. 

The original test included teaching the subjects the loom assembly, 
by performing it slowly and giving detailed verbal instructions, having 
each subject demonstrate her ability to perform it without error by doing 
so for two trials, and then having the subjects assemble six loom each,° 


5 The decision to have six looms comprise the original test was made after con- 
sidering a number of factors. It was found that it required approximately 45 min- 
utes to teach the loom assembly to a group and then have them assemble it six times. 
Since this meant that three groups could be handled in a half-day of school time, 
there was a real practical advantage in limiting the time required to 45 minutes. 

The more important consideration, however, was involved in the definition of 
original trial. Rapid improvement from the first assembly occurred in both the loom 
and safety switch assemblies, both for random subjects and for small experimental 
groups at an institution (9 subjects) and a girls’ vocational school in Philadelphia 
(15 subjects). The experimenter, repeating the assembly of the safety switch 30 
times in one sitting, found that the curve of improvement began to level off at the 
twentieth assembly, and that a plateau was appearing in the last 10 of the 30 trials. 

In order to ‘‘obtain (1) a reliable definition of initial performance, and (2) 
evidence for or against the assumption that rate of improveability will be constant 
for two groups of equal ability,’’ a research project was carried out in the experi- 
mental laboratory of the University of Pennsylvania by Miss Kaley Biser, under the 
direction of Dr. Malcolm G. Preston. Materials from the old Stenquist Mechanical 
Aptitude Test were used, the bicycle bell, the rope holder, and the door lock. Women 
subjects were taught how to put these together and then repeated assembly 25 times 
for each object. 


The results were treated in a variety of ways. The data were divided in half 








20 Patricia Woodward 


with a record of the time required for each assembly being made. The 
looms were assembled in groups of three, i.e., the subject worked steadily 
until three were completed. There were always parts left in the bins. 
All the subjects were given the final test, which consisted of assembling 
twelve looms each. The experimenter demonstrated the assembly and 
answered any questions about it, but the subjects were not permitted to 
practice with any of the materials.® 

Training of experimental subjects. The subjects in the experimental 
groups returned for periods of approximately 45 minutes each on five 
consecutive school days after their original test. They were taught and 
practiced the assembly of the safety switch, repeating it a total of 57 
times.’ There was always an interval of at least one and no more than 
two days between the last day of training and the final test. 

Motivation. A small prize was offered to the girl in each group of five 
who had the best time record, and another prize to the girl who showed 
the greatest improvement. Each time a group returned to the experi- 
mental room they were told which girls held these positions at that time. 
There was every indication that they were well motivated. 

Method of recording time scores. A running decimal hour watch was 
used to record the time scores. The running time was recorded when 
each assembly was completed, the point of completion being defined as 
the moment the base of the assembled loom or switch touched the table 
top. The actual time for each assembly was then obtained by subtract- 
ing the running time values. The watch was started over again for each 
group of three assemblies. 
and groups equated on the basis of mean score and standard deviation of scores on 
the first trial, the first two trials, the first three trials, the first four trials, and the 
first five trials on each assembly. Learning curves were then computed and plotted 
for these groups on the basis of these variously defined ‘‘initial trials.’’ The re- 
sults ‘‘did not indicate the optimum criterion of initial performance to be used in 
equating groups of subjects for learning experiments. However, the reliability of 
initial performance as the basis of equating groups tends to increase as more trials 
are used to select the groups.’’ The results suggested that using 44, or 16% of the 
total learning curve was a fairly adequate basis on which to equate the two groups. 

Our definition of ‘‘initial performance’’ is of course very different from four 
trials on one of the Stenquist assemblies, as the latter took considerably less time 
to learn or perform than the assembly of the loom. However, it seems that our use 
of %s or 33% of the total learning curve as the ‘‘initial performance’’ is a suffi- 
ciently liberal definition to permit adequate equation of the two groups. 

6 Although the original and final tests each required a period of approximateiy 
45 minutes, the average time spent in actual assembling was 14.2 minutes for the 
original test and 22.5 minutes for the final test. The discrepancy is due to the time 
required to teach and review the assembly and to disassemble and replace parts. 

7 The average time actually spent in assembling the switch was 1.16 hours. The 
discrepancy is due to the time required to replace the parts and to teach the assembly. 














Transfer of Training in Motor Learning 21 


The extent of variability in the experimenter’s reading was found 
to be no greater than + .0003 hour, or approximately 1 second.* A con- 
sistent method of recording was employed. 


Results 


Population from which control subjects came. Since the control sub- 
jects came from two trade schools, while the experimental subjects came 
from one school, it is important to know whether the control subjects can 
be considered as samples from the same population. The method of 
selection differed somewhat, in that those from the I. school were defi- 
nitely selected as the most likely ‘‘matches’’ for certain experimental 
subjects, and are not a random sample of the population falling within 


Table 2 
Comparison of B. and I. Control Groups 
B. (N-25) I. (N-33) Diff. P.E.p Crit. Ratio 








1.Q. Mean 89.0 92.3 3.3 1.02 3.2 
Trade Training Mean 11.1 7.1 4.0 1.4 2.8 
Trade Rating Mean 3.5 3.3 2 an 1.2 
Age Mean 16.4 16.0 A 21 1.9 
Time Interval Mean 10.5 11.2 7 .24 2.9 
Original Test 
Mean (all 6 trials) 400.3° 396.6 3.7 8.4 44 
Stan. Dev. 37.3 49.3 12.0 8.2 (S.E.) 1.46 
Final Test 
Mean First 3 trials 360.9 359.8 1.1 8.4 13 
Stan. Dev. 42.6 53.3 10.7 9.1(S.E.) 1.2 
Mean Second 3 trials 321.3 323.9 2.6 6.7 A 
Stan. Dev. 34.3 39.3 4.9 7.0(S.E.)  .7 
Mean Third 3 trials 311.8 301.3 10.5 7.0 1.5 
Stan. Dev. 29.5 31.7 2.2 5.9(S.E.) 37 
Mean Fourth 3 trials 299.8 290.9 8.9 5.9 1.5 
Dev. 31.7 32.3 6 6.1(S.E.) .09 





our arbitrary limits. Those from the B. school, on the other hand, are 
a large proportion of the selected sample of girls from this school who 
fell within our limits. 


8In order to test the accuracy of the readings, two devices of known speed were 
used. On the drum of a constant-speed kymograph were drawn six vertical lines, 
several very close together, and a given point past which the lines moved was de- 
termined. In addition to this, a large electric clock with a sweep second hand was 
used, and the time at which the hand passed the half-hour mark was recorded. Read- 
ings of the six lines on the kymograph drum and of the clock hand were made during 
practice periods on four different days. The degree of variation did not exceed 
+ .0003 hour of the actual time of rotation of the kymograph drum, nor + .0002 hour 
of the actual time of rotation of the second hand of the clock. 

® The time scores in Table 2 are reported in this form because they are easier to 
read and understand. Actually, 360.9 means .03609 hours. The data in subsequent 
tables will be reported in the same way. 





22 Patricia Woodward 


The 25 B. and 33 I. subjects are compared on the basis of their mean 
1.Q., age, months of trade training, trade rating, and time interval be- 
tween the original and final tests. 

Table 2 shows the various characteristics and the scores of the B. and 
I. subjects in the control group. Although the differences in I.Q., trade 
training, and time interval appear to approach a degree of significance, 
the actual size of the differences indicates that they cannot be considered 
of any significance. 

Comparison of total control and experimental groups. Table 3 shows 
the characteristics of the total control (58 subjects) and experimental 
(49 subjects) groups, and their performance on the original test. 








Table 3 
Comparison of the Total Experimental and Control Groups 
Exper. Cont. Crit. 
(N-49) (N-58) Diff. P.E.p Ratio 
1.Q. Mean 89.2 (N-44) 90.9 1.7 9 1.9 
Age Mean 16.8 16.1 wv 16 4.4 
Trade Training Mean 12.3 9.9 2.4 9 2.6 
Trade Rating Mean 3.4 3.4 0 
Time Interval Mean 10.9 10.9 0 
Original Test 
Mean (all 6 trials) 398.7 398.2 5 7.3 07 
Stan. Dev. 55.5 44.1 11.4 6.7 (S.E.) 1.7 





It is apparent that the two groups differ significantly in age, and have 
unequal variability in score on the original test. Any comparison based 
on these total groups would be subject to question particularly because 
of the age difference. Consequently, it seemed necessary to equate the 
groups more closely by dropping out certain subjects. Definite criteria 
were carefully used in order to prevent the operation of any biases of 
the experimenter. The first consideration was to decrease the difference 
in age between the two groups, and if there was any choice, the slight dif- 
ferences in I.Q. and in variability of the two groups were considered. 
An attempt was made to preserve the general form of the distributions 
in so far as possible.’° 

10 The criterion as to what limits should be permitted in an attempt to equate 
the two groups depends upon the degree of correlation between initial score and the 
various factors. Coefficients of correlation were computed for the ‘‘corrected’’ con- 
trol and experimental groups separately. The r’s of initial score on the loom with 
L.Q., trade rating, time interval, and of trade training with trade rating, I1.Q. with 
age, final score with time interval, were found to be insignificant for both groups. 
The r’s of initial score with age (— .55 + .07 for experimental and — .35 + .08 for 
the control group), and of trade training with age (+ .56 + .07 for experimental and 
+ .58 + .06 for the control group) were significant. The r of initial seore with 
trade training was significant for the experimental group (— .38+ .09) but in- 














Transfer of Training in Motor Learning 23 


Table 4 shows the characteristics of the ‘‘corrected’’ control and ex- 
perimental groups. The difference in age still produces a larger critical 
ratio than would be desirable. However, it appeared that the optimum 
equation of the two groups had been reached. 


Table 4 
Comparison of the “Corrected” Control and Experimental Groups 








Exper. Cont. Crit. 
(N-42) (N-48) Diff. P.E.p Ratio 
1.Q. Mean 89.6 90.1 5 94 5 
Age Mean 16.6 16.3 3 15 2.0 
Trade Training Mean 11.1 10.5 6 1.0 6 
Trade Rating Mean 3.5 3.4 P 14 Re 
Time Interval Mean 11.0 10.8 3 15 1.3 
Original Test 
Mean (all 6 trials) 399.1 399.5 4 6.7 .06 
Stan. Dev. 47.1 45.0 2.1 7.0 (S.E.) 3 





Table 5 shows the average time scores made on each trial by the two 
groups. The variation in the size of the critical ratios seems to follow a 
certain pattern in each group of three trials on the final test. The actual 
difference and the critical ratio decrease in size from the first to the 
third trial in each group. This might be interpreted as meaning that 
the experimental group showed the effects of its training on the safety 
switch in its more rapid handling of the loom assembly at the beginning 
of each group of three trials. However, the size of the critical ratios 
gradually increases through successive groups of three trials; the longer 
the practice on the loom continues, the more the two groups become sepa- 
rated. This steady increase suggests very strongly that if the test had 
been continued, an even greater difference might have occurred, and 
gives added weight to the actual size of the ratios. 

It seems most plausible to attribute the slow but steady improvement 
of the experimental group over the control group to something akin to 
interference or negative transfer. That is, the superiority of the experi- 
mental group did not show up on the first assemblies of the final test 
because getting accustomed to the loom assembly after having had train- 
ing on the safety switch was a much different problem for them than mere 
significant for the control group (— .24+ .09). None of the differences between 


comparable r’s for the control and experimental groups were significant when the z 
test was used. 

These r’s suggest that the fairly close equation of the ‘‘corrected’’ groups in 
regard to all the factors was not completely necessary, since many of the factors 
were not significantly related to the performance on the loom. 

Use of the partial correlation technique showed that holding either age or trade 
training constant lowers the correlation of the other with score; age seems to con- 
tribute more to the correlation with score than trade training. 








ee 


24 Patricia Woodward 


Table 5 


The Learning Curves on the Loom for the “Corrected” 
Control and Experimental Groups 


Control (N-48 . (N-42 
wr ) —— 











Diff. P.E.p Crit. Ratio 
Initial Test 
1 446.6 460.7 714.1 10.5 1.41 
2 420.2 427.6 t 7.4 9.4 79 
3 412.5 409.0 3.5 9.0 39 
4 373.9 374.0 ee 74 01 
5 376.9 367.1 9.8 8.0 1.22 
6 367.8 359.2 8.6 7.4 1.16 
Final Test 

1 383.4 368.6 14.8 9.8 1.51 
2 361.3 346.6 14.7 8.8 1.67 
3 337.9 331.8 6.1 7.6 80 
4 332.8 311.5 21.3 6.4 3.33 
5 322.8 319.6 3.2 7.1 45 
6 320.0 314.1 5.9 7.7 77 
7 311.5 298.7 12.8 5.7 2.24 
8 300.8 291.3 9.5 5.0 1.90 
9 304.1* 294.2* 9.9 7.7 1.28 
10 294.4 276.9 17.5 5.5 3.18 
11 299.6 282.6 17.0 6.4 2.65 
12 293.1 283.8 9.3 6.2 1.50 
* Indicates those means which were seriously distorted by the presence of one 


extreme score at the top of the distribution. 
t Indicates the differences in favor of the control group. 


repetition of the loom assembly was for the control group. This was also 
indicated in comments of the subjects. 

Table 6 is a condensation of the data of Table 5, and reports the 
average scores for each group of three assemblies. The steady increases 


Table 6 


Smoothed Learning Curves of “Corrected” Control and 
Experimental Groups on Final Loom Assembly 








Cont. Exper. 
(N-48) N42) Diff. P.E.p Crit. Ratio 
First 3 trials 
Mean 360.6 348.2 12.4 7.0 1.77 j 
Stan. Dev 44.8 52.6 
Second 8 trials 
Mean 324.6 317.1 7.5 6.0 1.25 
Stan. Dev 38.3 44.4 
Third 3 trials 
Mean 308.3 294.3 14.0 5.1 2.74 
Stan. Dev 31.3 37.9 
Fourth 3 trials 
Mean 295.2 281.2 14.0 4.9 2.86 
Stan. Dev 34.4 33.4 








Transfer of Training in Motor Learning 25 


in the size of the critical ratios is also apparent here and gives added 
weight to their actual size. The largest critical ratio, 2.86, means that 
there are 3 chances in 100 that as great a difference as the one occurring 
here could have occurred by chance. The actual size of the difference, 





.0400- 


0350 - 




















@ 0300 
S 
rs) 
a 
> 0250 
° 
3° 
” Control G 
ontr roups 
: -0200- Separate trials 
J eum Soooth curve 
- Experimental Groups 
——-— Separate trials 
0150 - eames smooth curve 
.0100- 
L 
~ 
™e, 
oa ~ 
~ — 
0050] eat hen Oe S 
-* 
Oo t t ' ‘ ' C Ls ' ' ' ul ' T 7 t ' 
r2 oo @ @ Or? 2s €e é€ FO 8 Re 2 
original test Final test 


Trials 


GrapPH I. Learning curves on loom for corrected control and experimental groups. 


0014 hour, which is roughly equivalent to a little over 5 seconds, is so 
small that its practical significance is very questionable. The differences 
between the groups, however, can be expected to be small because of the 
descending learning curve. The important question is whether the dif- 








cima ett es 


26 Patricia Woodward 


ference between the control and experimental groups would continue to 
increase with additional trials; if it did, even a small difference at this | 
point in the curves would be significant. An extension of the experiment 
would be necessary in order to answer this question. 

Graph I shows the learning curves of the ‘‘corrected’’ experimental 
and control groups from Table 5, and also the smoothed curves drawn 
from the data in Table 6. It is apparent that the two groups which were 
carefully equated on initial performance are showing increasing differ- 
ence in performance on the final test. 

When the gross improvement from initial to final test, caleulated by 
subtracting the average for the last 3 trials on the final test from the 
average of all 6 trials on the initial test, is compared for the two groups, 
the critical ratio, D/P.E.p, is 3.13, which means that there are 2 chances 
in 100 that a difference of this size could have occurred by chance. The 
actual difference in the means of the control and experimental groups is 
.00142 hour, practically the same as when the means of the final scores are 
compared. 

Matched pair comparisons. In addition to group comparisons, it was 
possible to match 19 pairs of subjects, using the criteria shown in Tables 
7 and 8. Each control subject had to possess characteristics none of 
which differed from those of her ‘‘matched’’ experimental subject by an 
amount greater than the limits which were set. 


Table 7 
Limits Used in “ Matching” and Standard Errors of Estimate 
S.E. of Estimate from Correlations 














Limits Used Cont. (N-48) Exper. (N-42) 
7 months 12 months 19.8 months 
L.Q. 5 points 6.2 points 6.6 points 
Trade training 3 months 6.9 months 6.5 months 
Trade rating 2 points .92 points 1.01 points 
Initial score .0010 hour 





That these limits are actually very conservative is shown by the sizes 
of the standard errors of estimate which were determined from the corre- 
lations of score with the various factors. In only one case, trade rating, 
is the limit used in matching as large or larger than the standard error 
of estimate. The small range of five points seems to account for this. 

In regard to the initial score, a criterion that could be satisfactorily 
used for deciding the limits for matching could be the standard error of 
the difference between the averages of the control and experimental 
groups for the first six trials on the loom. This standard error is .000098 
hour. 

















Transfer of Training in Motor Learning 27 








Table 8 
Differences in Matched Pairs 

Cont. Exper. Crit. 

(N-19) (N-19) Diff. P.E.p Ratio 
1.Q. Mean 91.4 90.5 9 1.05 86 
Age Mean 16.5 16.5 0 
Trade Training Mean 11.0 11.0 0 
Trade Rating Mean 3.3 3.5 2 2 1.00 
Time Interval Mean 11.1 10.9 2 3 .67 





For each trial, the actual difference between the scores of the control 
and experimental subjects in a pair was obtained. The standard error 
of these differences was then computed. Table 9 shows the differences 
and the interpretation of their significance according to Student’s tables 
(p). Values less than .01 are reported as .01. 








Table 9 
Learning Curves on Loom for Matched Pairs 
Control Exper. 
(N-19) (N-19 
Trial Mean Mean Diff. oD D/ep p 
Initial Test 
1 444.8 444.6 2 8.1 02 -—— 
2 415.8 427.9 12.1 14.6 83 21 
3 415.8 416.0 2 14.1 01 — 
4 371.4 369.4 2.0 9.8 .20 42 
5 383.1 362.1 21.0 11.0 1.90 .04 
6 361.5 361.6 a 12.3 01 — 
Final Test 
1 386.8 353.3 33.5 17.1 1.96 .03 
2 355.7 349.0 6.7 12.4 54 .30 
3 347.5 321.4 26.1 14.5 1.80 .04 
4 333.8 302.8 31.0 11.7 2.65 01 
5 331.3 322.2 9.1 6.4 1.42 .09 
6 315.3 307.3 8.0 10.4 77 .22 
7 320.8 291.4 29.4 10.4 2.82 O01 
8 311.9 287.5 24.4 10.5 2.32 01 
i) 302.0 287.9 14.1 9.0 1.57 07 
10 297.3 263.6 33.7 11.2 3.01 01 
ll 304.6 275.8 28.8 13.3 2.16 .02 
12 299.8 279.4 20.4 12.0 1.70 .05 








The same effect as that observed in the group comparison is apparent 
here. The size of the ratios shows a steady increase throughout the final 
test. This also appears when the averages for each group of 3 trials are 
compared, as shown in Table 10. 








28 Patricia Woodward 








Table 10 
Smoothed Learning Curves of Matched Pairs on Final Loom Assembly 

Cont. Exper. 

(N-19) (N-19) 

Mean Mean Diff. oD D/op Pp 
First 3 trials 363.4 341.2 22,2 11.6 1.92 .03 
Second 3 trials 326.8 307.6 19.2 9.9 1.94 .03 
Third 3 trials 311.6 299.1 12.5 8.4 1.49 .08 
Fourth 3 trials 300.5 272.9 27.6 10.3 2.69 01 





According to Student’s tables (11, 488), there are 7 chances in 1000 
that a difference as great as the one obtained between the 19 control and 
19 experimental subjects on the last 3 assemblies of the loom could have 
occurred by chance. This means that we can feel fairly confident that 
there is a real difference between the two groups of subjects. Since they 
were carefully matched for the various factors which seemed likely to 
influence performance and improvement, we can conclude that the dif- 
ference is due to the experimental conditions, that is, the training given 
to the experimental subjects. 

Graph IT shows the learning curves for the matched pairs. 


Discussion and Conclusion 


The present study has offered a critical approach to the problem of 
transfer of training in motor learning. An objective method of motion 
analysis has been used to establish the similarity between the two tasks, 
which are nearly identical in pattern of motions but differ greatly in the 
materials used. Trade school girls served as subjects. The results will 
be discussed in terms of their practical and theoretical importance. 

The fact of transfer of training between these two tasks has been 
demonstrated when the training consisted of mere repetition of the task, 
with no training in general principles of work nor any insight into the 
purpose of the experiment being given to the subject. Such transfer has 
not heretofore been shown in any study using materials which could be 
considered analagous to the industrial situation. 

Although the difference between the performance of the control and 
experimental groups on the final test is statistically significant, there can 
be some question about the practical significance of such a difference. 
The actual size of the difference is .0014 hour, or 5 seconds, when the 
‘‘eorrected’’ control and experimental groups are compared (see Tabie 
6), and is .00276 hour, or 10 seconds, when the 19 matched pairs are com- 
pared (see Table 10). In terms of percentage, these differences are 5% 
and 10% respectively of the average assembly time made by the experi- 
mental subjects on the last three assemblies of the loom. 





























Transfer of Training in Motor Learning 29 


Offhand, one might say that a difference of 5 or 10% was of no prac- 
tical importance. However, the fact that this difference appears when 
the subjects are on a descending learning curve means that it would 























ry 
S$ .0250° 
+) 
x 
= 
rs) Control Groups 
D4 ome separate trials 
a} eum Smooth curve 

Experimental Groups 
£ — —-—Separale trials 
is .0150 - emasesmooth curve 

-0100- 
“A 
vn Fa 
pan, F ro ~~ a oo. 
‘ 
o— 6 «ee ” ~—— —= =m es 
P 
oO _ 
' ' t tT ' ' 
‘ 2345612345678 900 @2 
original test final test 
Trials 


GraPH II. Learning curves on loom for matched pairs. 


tend to be small. The proper way to evaluate its significance would be 
to compare the length of time required by the experimental subjects to 
reach this point on the learning curve of the loom, and the length of 
time required by the control subjects to reach the same point. Unfortu- 








30 Patricia Woodward 


nately, this cannot be done with our data as the learning curve on the 
loom was not carried out for a sufficient length of time by the control 
group. 

Translated into practical terms, a saving of 10% would mean a saving 
of 48 minutes in an 8-hour working day, or that, if approximately .0273 
hour or 100 seconds were required to assemble one loom, approximately 
27 more looms could be assembled by the ‘‘trained’’ subjects as compared 
with the ‘‘control’’ subjects, a total of 293 looms a day as compared with 
266. If this difference in rate of working persisted, we would have an 
objective criterion of whether the 1.16 hours spent in assembling the 
safety switch produced a practical saving. We are not justified, how- 
ever, in assuming a certain course of the learning curve beyond the point 
at which our data stop." 

The chief theoretical significance of this study is that transfer of 
training has been demonstrated between two tasks representative of in- 
dustrial work when the only training given the subjects was the repetition 
of another task. This is in contradiction to Cox’s results (3), as he 
found no significant degree of transfer when the subjects merely repeated 
another assembly. No mention of general principles of work was made 
to the subjects in this study ; the only instructions were directions for the 
two assemblies, and statements to provide motivation. 

There are several possible explanations of why the transfer occurred. 
One is that the subjects recognized the fundamental identity of the two 
assemblies, and since the identity was apparent to them, profited much 
more from the training than if the identity had not been apparent. 
Bayles (2) would support the point of view that the identity has to be 
apparent to the subject if transfer is to occur. The writer, who con- 
ducted all the sessions with the subjects, does not believe this occurred 
for any subjects, because none of them indicated in her comments that 
she had any insight into the similarity of the two tasks. Numerous 
spontaneous comments were made when the subjects returned for the 
final test after the intervening practice, but they talked of the two as- 
semblies as two very different activities, and frequently expressed a 
preference for one as against the other. 

Another explanation of the occurrence of the transfer is to be found 
in the theory of identical elements. If the subjects did not comprehend 
the identity, and if they were not taught any general principles of work, 
it seems necessary to explain the transfer in terms of the elements which 
were identical to both assemblies. The identical elements can be defined 
in several different ways, as identical motions in the sub-assemblies, as 


11 The learning curves show a steady separation in Graph I, but we know that this 
eannot continue indefinitely as a physiological limit would be reached at some point. 





Transfer of Training in Motor Learning 31 


identical patterns in the whole assemblies, or as identical work situations. 

Except for the few exceptions listed above in the description of the 
materials, the various sub-assemblies of which the whole assemblies were 
composed were identical in the motions involved, 7.e., they required the 
same motions for picking up the materials, putting them together, ete. 
The act of screwing was the same although a tool was used in the safety 
switch and not in the loom. 

The patterns of the total assemblies were also nearly identical. The 
difference in total time required, 89 seconds for the safety switch and 
140 seconds for the loom at the beginning of practice, is attributed to the 
greater flexibility of the materials of the loom. Although the safety 
switch assembly was more ‘‘compressed’’ than that of the loom, it con- 
sisted of nearly the same pattern of motions. This pattern included 
reaching to the bins in a certain order, performing the screwing in 
proper sequence, etc. 

It is also possible that the subject’s total adjustment to the work 
situation was considerably improved by the training on the safety switch 
and that this can account for some of the transfer which the experimental 
subjects showed. By ‘‘total adjustment’’ is meant the way in which the 
subjects became accustomed to the work situation. They learned, for ex- 
ample, that the parts to be put together were always in the same places, 
that the sequence of bins had to be followed exactly, and that the com- 
pleted assembly could be hurriedly placed on the table top as they were 
reaching for another base. Although the control subjects were given the 
same directions, and had to follow the same sequence of bins, they were 
not nearly so accustomed to the work situation, and their manipulation 
of the parts was not in the same stage of automaticity. In the opinion 
of the writer, this total adjustment to the work situation also contributed 
to the transfer. 

We can conclude, then, that the transfer which has occurred is prob- 
ably due to the great similarity of the two assemblies. Whether it is 
due to the near identity of the specific motions, or of the general pattern 
of motions, or of the total work situation, or to a combination of these 
factors, we cannot tell from the present data. 


Bibliography 


1. Barnes, R. M. Motion and time study. New York: John Wiley and Sons, 1940. 
Pp. v + 382. 

2. Bayles, E. E. An unemphasized factor in current theories regarding the transfer 
of training. J. Educ. Psychol., 1936, 27, 425-430. 

3. Cox, J. W. Manual skill: its organization and development. Cambridge: Uni- 
versity Press, 1934. Pp. vii + 247. 








32 Patricia Woodward 


4. Crafts, L. W. Transfer as related to number of common elements. J. Gen. 
Psychol., 1935, 13, 147-158. 
5. Freeman, G. L. Studies in the psycho-physiology of transfer: I. The problem of 
identical elements. J. Exp. Psychol., 1937, 21, 521-532. 
6. James, H. E, O. The transfer of training. Brit. J. Psychol., 1930, 20, 322-332. 
7. Koepke, C. A. A job analysis survey. Occupations, 1934, 12, 15-35. 
8. Langdon, J. N. An experimental study of certain forms of manual dexterity. 
Indus. Health Res. Bd., 1932, Report No. 66. 
9. Langdon, J. N., and Yates, E. M. An experimental investigation into transfer of 
training in skilled performances. Brit. J. Psychol., 1928, 18, 411-424. 
10. Occupational Analysis. The study of aptitudes and attainments necessary for 
success in different kinds of employment. Nat’! Inst. Indus. Psychol., Report | 
No. 1, 1926. London: Offices of the Institute. ) 
11. Peters, C. C., and VanVoorhis, W. R. Statistical procedures and their mathe- 
matical bases. New York: McGraw-Hill, 1940. Pp. v + 510. 
12. Shaw, A. G. Some developments in motion study training, Seventh International 
Management Congress. Baltimore, Md.: Waverly Press, 1938, 124-128. 
13. Woodworth, R. 8. Eaperimental psychology. New York: Henry Holt, 1938. 
Pp. iii + 860. 





wes Seeman 














2 Se RoR tw 








The Use of the Minnesota Rate of Manipulation and the 
O’Connor Finger Dexterity Tests in the Selection 
of Package Wrappers 


Edwin E. Ghiselli 


University of California 


It is standard procedure in many counseling agencies to use the Min- 
nesota Rate of Manipulation (placing and turning) and the O’Connor 
Finger Dexterity Tests for the selection of persons for package wrapping 
positions. Recent results from extensive studies by Blum and Candee,' 
however, have indicated that these instruments are of little or no value 
for this purpose. The highest validity coefficient they report is one of 
.37 for the placing test. As near as can be judged, the average of their 
validity coefficients for the placing and turning tests is of the order of .20, 
while that of the O’Connor test is insignificantly different from zero. 
Furthermore, a multiple validity coefficient for these tests very probably 
would be less than .40. Investigations carried out by the U. S. Employ- 
ment Service ? have yielded similar results for the Minnesota tests. 

In spite of these rather conclusive results, a local counseling agency 
asked the writer to make some further checks on the usefulness of these 
tests in the selection of package wrappers. In order to do this, ratings 
were obtained of the work of 42 seasonal wrappers at a department store, 
all of whom had taken the Minnesota and O’Connor tests. The ratings 
were of quality and quantity of production as judged by supervisors. 
The coefficient of correlation between these two sets of ratings was .80, 
which indicated adequate reliability. The total of these ratings was 
used as the criterion of proficiency. The validity coefficients of the 
placing and turning tests were respectively —.10 and —.02, while that of 
the finger dexterity test was +.02. 

These findings confirm the results obtained by previous investigators 
showing that the Minnesota placing and turning tests, and the O’Connor 
Finger Dexterity Test are of little practical value in the selection of 
package wrappers. Clearly, then, further research on tests for the selec- 
tion of persons for this work should employ other types of items. Both 


1 Blum, M., & Candee, B. The selection of department store packers and wrappers 
with the aid of certain psychological tests. J. appl. Psychol., 1941, 25, 76-85 & 291-299. 

* Stead, W. H., Shartle, C. L., et al. Occupational Counseling Techniques. New 
York American Book Co. 1940. 


33 








34 Edwin E. Ghiselli 


the work of Blum and Candee and the U. S. Employment Service indicate 
that the name and number comparison items of the Minnesota Vocational 
Test for Clerical Workers will be useful, and the findings of the latter 
group of workers in addition suggests that tapping and dotting items such 
as those included in the MacQuarrie Test for Mechanical Ability and 
paper and pencil spatial relations items might be helpful. Certainly if 
motor and dexterity tests are to be employed, they should be very dif- 
ferent in nature from the placing, turning, and finger dexterity tests. 


ens 


SF 
ergy: 


Se Ns see 


—— 





See. 























Selection of Sewing Machine Operators 


Milton L. Blum 
College of the City of New York and Vocational Service for Juniors 


A glove manufacturing concern faced with the problem of high labor 
turnover became interested in the possibilities that test selected em- 
ployees might result in reduced labor turnover. 

A study of the various jobs indicated that a vast majority of the 
assemblers used electrically driven sewing machines with slight varia- 
tions in the needle attachment. Four main divisions comprise the jobs 
at the factory, namely: fourchetting, thumb inserting, closing, and pointing. 

Arrangements were made to test fifty employees at the factory, 
twenty-five of whom were to be in the highest quartile of earnings and 
twenty-five of whom were to be in the lowest quartile. As salaries are 
based upon a piece rate system, these two groups are therefore representa- 
tive of the most and least productive employees at the factory. These 
employees were to be sent in alphabetical order to a specially designated 
testing space apart from the factory so as to avoid any suspicion as to 
why only certain people were tested. The group was informed that only 
a small number from each department were selected at random to take 
part in the experiment. They were given a statement from the firm 
informing them that this was an experiment to devise a sewing machine 
test conducted by the New York State Employment Service and that the 
test scores were to be the private information of the Service and were not 
to be made known to the firm. 


Selection of Tests 


It was necessary in considering the tests to be selected to remember 
that many of the subjects were well on in years and therefore could not 
be subjected to a long period of pencil and paper testing. A survey of 
the available tests resulted in the adoption of the MacQuarrie Test for 
Mechanical Ability. In addition, an original sewing machine test sample 
that included the job aspects to a high degree was administered. 

The MacQuarrie test is widely known and therefore will not be de- 
scribed except to mention that it is a paper and pencil group test that 
lasts about twenty minutes. It has seven sub-tests and measures such 
abilities as tapping, dotting, tracing, and tridimensional space perception. 

35 





‘ees 


36 Milton L. Blum 


The original work sample, hereafter referred to as the Blum Sewing 
Machine Test was devised after a job analysis indicated that the basic 
elements of the job might best be measured by such a performance test 
rather than a pencil and paper test. Samples of the test are illustrated. 























| a Oe NAME 4 
A B 


Blum Sewing Machine Test (Reduced one-half) 


The person taking the test is asked to trace on the line in Test A and 
between the lines in Test B. The line is zig-zagged to resemble changes 
in angles required in actual sewing jobs. The person is not to sew with 
thread. The needle of the sewing machine makes a perforated line and 

















Selection of Sewing Machine Operators 37 


performance is measured in time to complete the task as well as an actual 
record of errors as indicated by the number of holes away from the line. 
Each subject is required to take Test A, then Test B, and finally repeat 
each of the tests a second time. 

The intercorrelations ! obtained are reported in Table 1. 








Table 1 
Intercorrelations on Blum Sewing Machine Test 
Total Time Correlation 

SE ne ee a ee +.71 
Test A and Test A repeated..................... +.74 
Test B and Test B repeated..................... + .82 
Test A repeated and Test B repeated............. +.84 
Test A and Test A repeated and Test B 

SE PE PIs bs cc ccnsevescoecusecances +.83 





Test ‘“‘A” is more difficult than test ‘“‘B” because the needle perfora- 
tions must be made on the line rather than in the space between the lines 
as in test ‘‘B.” Test ‘‘A” undoubtedly measures, in addition to a funda- 
mental ability, performance before an adjustment to the specific task 
can be made. It is likely, therefore, that the correlations in the eighties 
are more truly representative of steady performance. However, there 
are many reasons why performance before initial adjustment should also 
be measured. One of the most important is that this ability may ac- 
tually be a measure of the speed at which an employee can be trained 
whereas the later performance may more closely be related to ultimate 
maximum production. 


Criteria Selected 


Two criteria of success were selected. The first was the production 
record which was based upon piece rate earnings of the individuals in- 
volved. The second was the employee ratings by supervisors. With 
reference to production rating, each of the four departments studied was 
found to contribute equally to each quartile. The department average 
earnings were 44¢, 47¢, 45¢ and 45¢ respectively. The indications 
are that the piece rate systems are rather similar regardless of the depart- 
ment insofar as this experiment is concerned. 

The supervisors ratings were found to be distributed as follows: 


1 It is noted that correlation coefficients computed as a result of a distribution com- 
prised of two extreme groups is statistically questionable as such a distribution tends to 
raise the coefficient artificially. These coefficients are reported merely to indicate the 
approximate relationships existing between the tests rather than to present the numerical 
values of the relationships. 





38 Milton L. Blum 














Table 2 
Supervisors Ratings of Employees 
Employees 
Supervisors and Department Q1 of earnings Q4 of earnings 
= mee ie si. =p 8 
(Laura) ourchetting....... 7 .. = 
aay & Kitty) a Fie ed aka dh 5 2 3 
Zelly) thumb inserting. . 3 2 5 
(Helen) CR S .3.3 ee 1 
Sibars wwiateeteseséuainedl a => a a Se ae | 





Examination of table 2 reveals that the more productive employees 
(Q1) tend to receive better ratings than the least productive employees. 
The directions to the supervisors were not too definite and specifically 
avoided mention of production as such. They were merely asked to 
assign a letter to each employee as follows: 


A—a very good worker—wish all were like her 
B—a good worker 

C—an average worker 

D—a poor worker 

E—an impossible employee 


No further comments were made because it was felt desirable not 
to put too much pressure on the supervisors. The explanation given 
to the supervisors for wanting the ratings was that they were a necessary 
part of the experiment and in no way were to be a reflection of the super- 
visor or her ability. 


Results 


1. MacQuarrie and Production Records 


An analysis of the distribution of scores indicates that the best possi- 
ble total score is 60 or better on the MacQuarrie. If this is set as the 
critical score, 92% of the group that is lowest in salary earnings would be 
eliminated. However, 62% of the employees who are in the highest 
earning quartile are also eliminated. Even though the difference be- 
tween these two percentages is not statistically reliable (D/od = 2.7), it 
is nevertheless questionable, in view of the present employment difficul- 
ties, whether a firm could afford to adopt a critical score that would 
eliminate such a high percentage of its best workers. 

Analysis of the various sub-tests reveals the following: If on the trac- 
ing test, a critical score of 30 or better is demanded, it is found that 40% 

















Selection of Sewing Machine Operators 39 


of the good workers would have been eliminated while 76% of the poor 
workers would not be included. This difference is statistically reliable 
(D/od = 3). The tapping and dotting tests do not differentiate between 
the groups at all. The copying test and the block location test tend to 
discriminate between the groups but the percentage of the good group 
eliminated is too large in view of present employment difficulties. 


2. MacQuarrie and Supervisors Ratings 


Eighteen employees in the fourth quartile received a ‘‘C” rating. 
Their average performance on the MacQuarrie was 45.1. Eighteen em- 
ployees in the first quartile received an “‘A’’ rating. Their average score 
on this test was 48.7. The difference between these two means is not 
statistically reliable (D/od = 1.4). Similar results are also obtained upon 
analysis of the various sub-tests. There is a slight tendency for the 
workers rated ‘‘A’’ to be superior to the ‘‘C”’ workers but in no instance 
do the differences approach statistical reliability. 


3. Blum Sewing Machine Test and Production Records 


The relation existing between the production records and the scores 
on the sewing machine test is positive, faster time accompanies greater 
production. The correlation between total time of tests “‘A” and “A 
repeated” and production record is —.40 (+.12). The relation between 
the total time of tests ‘‘B” and “‘B repeated” is —.33. The correlation 
between total time on all four tests and production is —.42. 

Examination of the scatter diagram allows for the setting up of any 
of the critical scores suggested in Table 3 depending upon the availability 
of applicants and the ability possessed by these individuals. 








Table 3 
Critical Score and Percentage of Groups Eliminated 

Critical % of Poor Group % of Good Group 
Score Eliminated Eliminated 
260+ 20% (0)% 
240+ 0 4% 

220+ 56% 20% 

200+ 72% 28% 





4. Blum Sewing Machine Test and Supervisors Ratings 


The average score (total of all four tests) of the eighteen employees 
rated ‘““A”’ by the supervisors was 196. The eighteen rated ‘‘C” in the 
Q4 group averaged 244. Here the difference is statistically reliable 
(D/od = 3). The average score of the ‘‘A’”’ workers is statistically dif- 
ferent from the average score of those who are rated ‘“‘C.” The sewing 












40 





Milton L. Blum 


machine test results differentiate the good from the mediocre operators 
on the basis of either of the two criteria used. 


Aptitude or Achievement? 


The results presented would indicate that this sewing machine test 
affords a valid measure to select good operators who are experienced. 
Because of the present difficulties in obtaining experienced operators, a 
practical question is whether this test can differentiate between inex- 
perienced operators who will be good as compared with those who will 
not be successful. Seventeen applicants were hired by the firm without 
reference to test score. Ten girls were still employed whereas seven girls 
dropped out after an average of twenty-one hours of work (range 16 to 
44 hours). Analysis of the test results indicated that prediction would 
not have been possible. Anyone of three explanations is possible: 
1—limited sampling; 2—the test is an achievement test and not an 
aptitude test; 3—the better girls do not remain at this particular factory 
because of many reasons peculiar to the factory itself. 

Continuing the use of this test at this and other establishments that 
afford the opportunity of follow-up is suggested to completely answer 
whether this sewing machine test can be regarded as an aptitude measur- 
ing device or is solely an achievement test. 


Summary 


The tracing sub-test of the MacQuarrie discriminates between two 
groups of sewing machine operators to a better extent than any of the 
other sub-tests or the total test. 

The Blum Sewing Machine Test allows for differentiation between 
the highest quartile of employees and the lowest quartile. Depending 
upon the availability of the labor supply the critical score to be established 
varies between 200 seconds to 260 seconds. The critical score of 200 
seconds is recommended as best. This score applies only to experienced 
operators. 














The Constancy of the Intelligence Quotient as Indicated by 
Retests of 130 Children * 


Mary Elizabeth Allan and Florene M. Young 
University of Georgia 


The question of the constancy of the I.Q. has occasioned much dis- 
cussion and research in America. Marked differences of opinion may be 
noted in the literature on the subject, and it is one of the important cur- 
rent controversies. It is not feasible to summarize here the findings from 
the numerous writers on the subject. However, a few of the sources con- 
taining material on the problem are cited in comparing the results of the 
present study with some of the findings of previous investigators. 


Problem and Method 


The present study is an investigation of the constancy of the intelli- 
gence quotients of 130 children, and the relationship of this constancy to 
the length of interval between tests, to the tests used, to sex, to age, and 
to the level of intelligence at the time of the first test. 

Of the 130 cases, 100 received individual examinations both for the 
initial test and the retest. For this group, Group D, the mean age at the 
first test was four years, and at the retest it was approximately nine and 
one-half years. The 30 subjects in Group E received individual tests at 
the mean age of nine years eight months, and after an average of seven 
years three months they were given a group test. 

Initial test records for the 130 cases were obtained from the files of 
the Psychology Department. Of these subjects 94 were tested during 
attendance at the Nursery School of The University of Georgia. The 
other subjects did not attend the Nursery School and were tested at a 
slightly higher age level. 

All examinations were administered by members of the staff of the 
Psychology Department, or the School of Home Economics, or by ad- 
vanced students trained in clinical testing. Twenty-eight of the earlier 
tests of preschool children were given by Dr. Dorothea McCarthy, or by 

* The present study is based upon a portion of the masters thesis of Miss Allan, 
prepared under the supervision of the senior author, Dr. Young, at The University of 
Georgia. The latter has made further checks upon the data, and prepared the present 
manuscript for publication. The authors wish to acknowledge with thanks the valuable 


suggestions received from Dr. A. S. Edwards in connection with the various phases 


of the study. a 





42 Mary Elizabeth Allan and Florene M. Young 





Mrs. Katherine Shaw Michael, who were associated with the School of 
Home Economics as child psychologists on the staff of the Nursery 
School. 

Retest as well as initial test records for 23 cases were already available 
in the files of the Psychology Department. All examinations for these 
cases were given by Dr. A. 8. Edwards, or by students trained by him in 
testing. 

Forty-three of the retests were given by the senior author of this 
study, or by advanced clinical psychology students who were constantly 
under her personal supervision in their testing. The junior author gave 
64 retests. The initial records were gathered over a period of approxi- 
mately eleven years. The retesting program covered a period of six 
months, from October 22, 1940, through April 24, 1941. When the 
additional records which were available from Dr. Edwards are included 
it may be stated that the second tests were given within a period of seven 
years seven months. 

Three tests of individual type were used for the initial tests: The 
Merrill-Palmer (11), the Stanford-Binet, 1916 Revision (12), and the 
Terman-Merrill, 1937 Revision (13). 

For the retests, the Terman-Merrill, 1937 Revision, was employed 
for the 100 individual examinations. Thirty subjects had as retests the 
Psychological Examination of the American Council on Education. Of 
these, 23 took the tests with the other freshmen entering the University 
of Georgia. Seven others who were completing high school work re- 
ceived the test within six weeks of graduation. A total of 30 cases, there- 
fore, received a group test which is, to a large extent, dependent upon 
reading skill. 

The testing of children of preschool or primary level usually was done 
in the special testing room of the Nursery School where the small tables 
and chairs made them more comfortable. The majority of these subjects 
were attending or had attended the school and were familiar with the sur- 
roundings. The older children usually were examined in one of the clinic 
rooms of the Department of Psychology. The 30 subjects receiving the 
A.C.E. examination as the retest took the group test with the other 
freshmen in a large auditorium, or in a classroom with other seniors in 
high school. 

No test or retest record was used if there was evidence that the child 
at the time of the examination was not well. Nor was any record em- 
ployed if it contained a notation that crying, tantrums, or consistent 
negativism occurred during the testing period. As an additional pre- 
caution, the scoring and the calculation of age, mental age, and I.Q., were 
rechecked by the junior author for all records. 





Constancy of the Intelligence Quotient 43 


For the initial tests, 78 of the 100 subjects who were given individual 
tests throughout, received the Merrill-Palmer Scale (Table 1). This 


Table 1 
Subjects in Each Group Receiving Specified Tests ! 
Initial Test 
Group _—: Tae meee 


A (Interval of 1-2:11) 19 
B (Interval of 3-5:11)............... 

C (Interval of 6-12) 

D ‘Interval of 1-12) 

Be IDs sane cccccccccvass 30 
1.Q.s 115 up 

1.Q.8 93-114 














AAOOAHONK OOO 
RoBSSELoAsS 





1 Abbreviations of tests refer respectively to: Stanford-Binet, Terman-Merrill, 
Merrill-Palmer (Stutsman), Terman-Merrill, and Psychological Examination of the 
American Council on Education. 


scale is characterized by the presence of more timed responses than are 
found in the Binet Scales. It also contains more motor tests and tests 
of form perception. In comparing the results of the Merrill-Palmer with 
those of the Stanford-Binet, Stutsman (11, p. 348) said, ‘‘ Unless children 
are suffering from some special disability or from a language handicap, 
their scores on the Stanford-Binet and Merrill-Palmer Scales usually show 
a fairly close correspondence.” A group of 115 children at the Merrill- 
Palmer School received both series of tests and a correlation of .783 
+ .025 was found (11, pp. 112-116). Also, a group of 159 subjects 
between the ages of three and six received both series of tests and a cor- 
relation of .793 + .019 was obtained. 

Terman and Merrill (13, pp. 50-51) have stated that the new scales 
measure almost the same functions as were measured by the original 
Stanford-Binet. Verification of this statement is found in the correlation 
obtained when both scales were administered to the same subjects. For 
123 children ranging in age from five to twelve years and retested after a 
five months interval, the Stanford-Binet (1916 Revision) and the com- 
posite L — M of the 1937 Revision showed a correlation of .877. An- 
other sample of 120 cases showed a correlation of .925 when the L scale 
of the 1937 Revision was administered to a group of subjects who had 
been tested previously with the Stanford-Binet. 

For purposes of comparison the subjects were divided into three 
groups, A, B, and C, according to the length of intervals between tests. 








44 Mary Elizabeth Allan and Florene M. Young 





Group A, for instance, is composed of children retested after one year 
had elapsed but prior to the beginning of the third year. Group D is 
composed of the 100 subjects receiving individual tests throughout. 

Group E is made up of 30 subjects whose initial test was the Stanford- 
Binet (1916 Revision) and whose retest was the A.C.E., a group test. 

The subjects also were grouped according to the level of intelligence 
on the first test, 51 cases having I.Q.s of 93-114, and 49 having I.Q.s of 
115 or higher. 

In Table 1 likewise may be found the number of boys and girls taking 
the designated tests, the number for whom the Stutsman (Merrill-Palmer) 
and the Binet Scales were used, and the subjects for whom the Binet 
Seales were employed for both initial and final tests. 

Some investigators (2, pp. 164-165) have reported that existing tests, 
including the Merrill-Palmer Scale, yield unsatisfactory results when 
administered to children of less than 30 months of age. Others have 
stated that I.Q.s obtained for the higher age levels are open to question. 
Consequently, 42 children were tested and retested between the ages of 
three and twelve years. Presumably, the results for this group would be 
open to less criticism for the reasons just stated. 

All subjects included in this investigation came from relatively for- 
tunate homes. A large proportion came from homes belonging in Group 
I, Professional, or Group II, Semi-professional and managerial. A smaller 
percentage came from Group III, Clerical, skilled trades, and retail 
business. This classification of occupations is from that of Goodenough 
and Anderson (6, p. 237). 


Results 


The results of the present experiment are given in Tables 2-6, in- 
clusive, for the several groups described in Table 1. The reliability of 
the difference between means of the first and second tests for specific 
groups may be found in Table 2. All data in this table refer to the 100 
subjects receiving individual tests throughout. The critical ratio tech- 
nique which has been used is explained by Cooke (3, pp. 99-106). The 
correlations involved in the formula are given in Table 6. 

Group A, composed of 26 children retested after a period of less than 
three years but after an interval of at least one year, shows a mean I.Q. 
change of only three points. This difference is not statistically signifi- 
cant, the critical ratio of 1.10 indicating approximately 86 chances in 100 
that the difference is a true one. It should be noted that the mean age 
at the initial test was three years eight months, and at the retest the mean 
age was five years six months. This finding agrees with that of Hildreth 
(8, pp. 373-374), who has reported that unless the length of the interval 





~ 
= 
2 
~ 
Ss 
3 
> 
3 
L 
= 
SS 
y 
~ 
= 
~~ 
= 
~ 
— 
Ss 
> 
= 
8 
2 
Ss 
S 


‘pus ‘D ‘g ‘Vv sdnoup 103 ‘f pu’ g SUUINIOD UT pozeyS SB [BAIOZUI 94} 04 S1OJoI BuIdNoIy , 
‘9 QQB], Ul puno}y oq ABUL BINULIOJ BY} Ul PaATOAUI SUOTZBIOIIOD ; 





£0°2/9 


cl'S/F 
1S 1/6 
£2°2/9 
£0°2/ TT 
68° I/F 
68'1/ 
8F'1/8 
$9°2/1T 
86'1/6 
bL2/é 


i s 
N 


aB5323S9S8 
eat ick chord Sk oh Boke) 


i 


cL‘or 


00°ZT 
SZ Fl 
0ST 
00°ST 
cg'eT 
09 ZI 
02 FT 
02° LT 
00°ST 
O8'IT 


BSEVHaRALSR S 
ANN SH SHH NRHA 


~ 
a 


ee 
—2=S=2QNeAQ 


— 
— 


OLeT 


02° FT 
cei 
ST eI 
Os IT 
ves 

91'°6 

02°€T 
82° FI 
pL 01 
ST PI 


PIT Oo 


S21 
PIT 
0ZT 
sil 
90T 
L21 
9IT 
cIl 
OTT 
SIT 


_ 
—_— 


v 


68 I8-T 


$:¢ éI-1 
&¢ F0I-E1 
° oI-T 

G ZI-*T 
ZI-LT 

ZI-1 

: éI-1 
OF: oI-9 
: Il-¢-¢ 
OT: IlZ-1 


Zh ZI pusgsese 
u9eM4eq pews], 
youlrg-joulg 
** qQUIG-UBUIS}NIG 


FII-86 FOI 
dn cit sO] 





OOT Ur oNnvy (PIP) ‘a’g Ue u's ‘a's 


seousy) TRHMO sousyiq ~“~°s!Id 


DO | RODNAQDAQWH 


<lee 


ad 
= 
2 
= 


OI . 


uve U 


<S|neontethan 


£5 OO HH HOS Si 09 005 


uBayT o3u¥ yy 





‘OI 


989], puooeg 


aL WY 


“*sOUl pus 
“S14 Ul [BAIOZUT 





8380,], [eNplarpuy Zurareoey sdnosry oywedg 10; 8489], puocoeg puv 4ysuly Jo suBapy UIEMJOG , BOUBIEyIC] 94} Jo AyTIGeuIey 


6 W481 








46 Mary Elizabeth Allan and Florene M. Young 


between tests is greater than three years there is little variation in test 
results. However, Rugg (10, pp. 341-343), and Gray and Marsden (7, 
pp. 20-26) found that the time interval had no effect upon the stability 
of the 1.Q. 

Group B, composed of 36 children retested after three to six years, 
showed an increase of nine I.Q. points. The critical ratio of 4.55 indi- 
cates that the difference is statistically significant. 

Thirty-eight children, retested after six to twelve years, are included 
in Group C. These subjects averaged twelve years eleven months of age 
at the time of the second test, which was given after a mean interval of 
eight years ten months. The mean change, a gain of eleven I.Q. points, 
is a statistically significant one, the critical ratio being 4.17. 

The group of 100 children (Group D) receiving individual tests 
throughout, showed a mean gain of eight I.Q. points on the retest. The 
critical ratio of 5.41 indicates that the difference is statistically significant. 
A mean interval of five years six months elapsed between initial and 
second tests, the respective mean ages being four years, and nine years 
five months. 

When the 49 children whose initial I.Q.s were 115 or higher are com- 
pared with the 51 subjects whose initial I.Q.s were 93 to 114, inclusive, it 
is found that the latter, which may be regarded as approximately a nor- 
mal group, showed much greater change than did the superior group. 
The latter gained an average of three I.Q. points over a period of five 
years seven months, and this change is not statistically significant, as 
indicated by a critical ratio of 1.59. The normal group gained an average 
of 14 1.Q. points, after a mean interval of five years four months, and this 
difference is highly significant, the critical ratio being 7.41. 

From these data it would appear that the normal cases, I.Q.s 93-114 
on initial tests, were not demonstrating their full ability on the first tests, 
which were given at a mean age of three years seven months. At that 
time the I.Q. average was 106, whereas it should have been 120, according 
to the later tests. It seems possible that subjects as immature as were 
these children at the initial testing cannot be depended upon to put forth 
their optimum effort, hence the results may not be indicative of their true 
ability in all instances. 

An interesting observation related to the points just noted is the fact 
that it is boys rather than girls who accounted for most of the change in 
I.Q. On the first test 31 boys and only 20 girls came in the group having 
1.Q.s of 93-114. On the initial test boys had a mean I.Q. of 113 whereas 
that of girls was 120, the average age of the subjects being four years one 
month, and three years nine months, respectively. On the retest, boys 
and girls have almost the same mean I.Q.s, the respective records being 














Constancy of the Intelligence Quotient 47 


124 and 126. The gain of eleven points for boys is a statistically signifi- 
cant difference, the critical ratio being 5.42. The gain for girls is six 
1.Q. points, and the critical ratio of 2.69 indicates 99.5 chances in 100 that 
the difference is a true one. In this connection it may be noted that 
Foran (5, pp. 9-12) regards a variation of five points or less as evidence 
of I.Q. constancy. Gray and Marsden (7, pp. 20-26) compared retests 
involving 109 children and found a probable error of estimate of 5.5 I.Q. 
points. 

It is important to observe that boys are less likely to reveal their real 
ability at the age of four years one month than are girls of similar age. 
Whether this is due to a slower rate of maturing, greater timidity on the 
part of preschool boys, or to other factors, is not known. It is possible 
that girls were superior to boys in verbal responses on the preschool tests. 
Davis (4, p. 49) obtained a superiority in sentence length for girls as 
compared with boys at age five and one-half years, and also at later ages. 
When Young (14, p. 68) compared boys and girls it was found that pre- 
school girls talked more than preschool boys, the difference being statis- 
tically significant. 

In Table 2 may be found the results of 78 children tested with the 
Merrill-Palmer Scale (Stutsman) and retested with the Terman-Merrill 
Revision (1937) of the Stanford-Binet. The change of scores into I.Q.s 
was done with the tables provided by Stutsman (11) in her manual. The 
group included subjects whose mean chronological age at the time of the 
first test was three years five months, and whose mean age at the time 
of the retest was eight years nine months. The I.Q. on the second test 
was nine points higher and the critical ratio of 5.92 indicates that the 
difference between the two means is a true one. 

Results also are given in Table 2 for 16 subjects tested with the Stan- 
ford-Binet (1916 Revision) and 6 tested with the Terman-Merrill Scale 
(1937 Revision). The 22 children included in this group show a mean 
gain of four I.Q. points on the retest with the Terman-Merrill Scale. 
The critical ratio of 1.27 indicates that this difference is not statistically 
significant. 

The data obtained in this investigation seem to indicate that the I.Q. 
is constant when Binet tests are used for the initial test and also for the 
retest. However, when the I.Q.s from the Merrill-Palmer Scale are used 
for the first test the difference between it and the retest is statistically 
significant. The transmuting of the Stutsman (or Merrill-Palmer) scores 
into I.Q.s may have been a factor affecting the results. 

Table 2 shows that the 42 children who were tested and retested be- 
tween the ages of three and twelve years gained six I.Q. points, the critical 
ratio of 2.96 indicating 99.9 chances in 100 that there is a true difference 





Mary Elizabeth Allan and Florene M. Young 


Table 3 


Number and Per Cent in Specific Groups Showing Gain, Loss, No Change, = Total Change in I.Q. Points 
ests 


dual 


(Based upon Cases Receiving Indivi 





Total Change 


No Change 


% 


No. 





Mean 
Interval 


No. 


Group 








BA SOGNSH SAMAR 
ieee a eee ieee] 


RSSSSSSSRRS 


Fe ie oe he BH ce he OO 


=RARRSSRER 


o..S 
—=R-REEQNLVR 
mH 019. 1H 1H.1H HHO 09 

















Constancy of the Intelligence Quotient 49 


between the test and retest means. The average age at the initial test 
was four years, and at the retest it was seven years nine months. When 
these subjects are compared with the group of 100 subjects (Group D) 
it will be noted that the latter gained only two points more than that 
shown by the group tested within the age limits 3-12 years. The restric- 
tion to the ages where the tests are less open to criticism results in a 
slightly smaller difference between test and retest, but a difference which 
is, nevertheless, statistically significant. 

The number and per cent of subjects showing gain, loss, no change, 
and total change, may be found in Table 3. The total change is obtained 
by adding all the changes in I.Q. regardless of algebraic signs, and dividing 
by the number of subjects showing the changes. 











Table 4 
Percentage of Subjects in Designated Groups with Specified Changes ' in I.Q. Points 
1-5 6-10 0-10 11-20 21+ 11-20 
Points Points Points Points Points Points 
Group + - + — Change + -— + — Change 
OS dia ion wy la givin i 115 23.1 154 3.9 53.9 19.2 7.7 11.5 7.7 46.1 
EES a 19.4 16.7 16.7 5.6 61.2 22.2 0 16.7 0 38.9 
iat is: si dibe Seni 7.9 10.5 15.8 5.3 42.1 23.7 53 26.3 2.6 57.9 
ae 13.0 16.0 16.0 5.0 52.0 22.0 4.0 19.0 3.0 48.0 
1.Q. 115+...... 8.2 22.5 30.6 8.2 71.5 8.2 6.1 8.2 6.1 28.6 
L.Q. 93-114..... 17.7 98 2.0 2.0 33.5 35.3 2.0 29.4 0 66.7 
aS. 13.7 13.7 21.6 2.0 54.9 19.6 0 23.5 2.0 45.1 
IAF 12.2 18.4 10.2 8.2 49.0 24.5 8.2 14.3 4.1 51.1 
Stutsman-Binet . 14.1 15.4 16.7 3.9 52.7 21.8 2.6 20.5 2.6 47.5 
Binet-Binet..... 9.1 18.2 18.2 9.1 54.6 18.2 9.1 13.6 4.6 45.5 
Ages 3-12...... 14.3 19.1 119 48 52.5 23.8 4.8 14.3 4.8 47.7 





1 Gains are indicated by plus signs, and losses by minus signs. 


In Table 4 may be found the percentage of subjects in designated 
groups with specified changes in I.Q. points. All data in Tables 3 and 
4 are based upon the 100 subjects receiving individual tests throughout. 

In considering the relation of the length of time between tests to the 
constancy of the I.Q. it may be noted that the largest mean gain is to be 
found in the record of Group C, the group with the longest interval, eight 
years ten months, between tests (Table 3). For 98 per cent of the 100 
cases some change in I.Q. was noted, the mean total change being 12.8 
1.Q. points (Table 3). 

The greatest loss was evidenced by the group retested after the shortest 
period, that is, after a mean interval of one year ten months. In this 
group, Group A, 42.3 per cent of the 26 subjects lost an average of 9.4 I.Q. 
points (Table 3). However, more members of this group gained than 
lost, 57.7 per cent registering an increase in I.Q. Of the 100 subjects 








50 Mary Elizabeth Allan and Florene M. Young 


receiving individual tests throughout, 28 per cent had lower I.Q.s on the 
retest, the mean loss for these cases being 7.5 I.Q. points (Table 3). 

It should be noted that two and one-half times as many subjects 
gained as lost in I.Q. points. Moreover, the gain in I.Q. points was 
approximately twice as large as the loss (Table 3). 

The per cent of each group whose I.Q. changed eleven or more points 
is as follows (Table 4): Group A, 46.1 per cent; Group B, 38.9 per cent; 
Group C, 57.9 per cent; Group D, which is composed of the 100 subjects 
having individual tests throughout, 48 per cent. 

If the constant I.Q. is defined as one which changes ten points or less, 
52 per cent of the 100 subjects in Group D showed constant I.Q.s (Table 
4). A gain of eleven to twenty points was shown by 22 per cent, while 
19 per cent increased more than 20 I.Q. points. In accordance with 
Table 3, the gains in Table 4 are far greater than the losses. This is 
contrary to the findings of Brown (1, pp. 86-90), who noted a tendency 
for loss in I.Q. points in a study of 124 children. Of his subjects, 53 lost 
more than 16 I.Q. points, and the number of points lost was more than 
twice the number gained. 

The I.Q. changes of normal and superior children are given in Table 3. 
Of the normal subjects, those whose I.Q.s ranged from 93 to 115, 98 per 
cent showed a fluctuation in I.Q., the mean total change amounting to 
15.3 1.Q. points. Of the brighter subjects, 98 per cent likewise showed a 
change in I.Q., but the total change of 10.1 I.Q. points was 5.2 points 
less than that of the normal group. Far more of the normal cases gained 
than did the bright subjects, the respective proportions being 84.3 and 
55.1 per cent. About three times as many bright as normal subjects lost 
1.Q. points, 42.9 per cent of the superior group having a decrease in I.Q. 
This decrease amounted to a mean of 8.4 I.Q. points. 

From Table 4 it may also be seen that the normal group was far more 
variable than the superior group. Of the group having I.Q.s of 93-114 
about two-thirds changed eleven I1.Q. points or more, but only 28.6 
per cent of the brighter subjects showed such a marked fluctuation. 
These facts corroborate those noted in connection with Tables 2 and 3 
in showing that the I.Q. is more constant for the brighter group than for 
the normal group. 

For both boys and girls a change in I.Q. was found to be characteristic. 
The respective proportions having some degree of change were 96.1 per 
cent and 100 per cent (Table 3). The number of boys gaining was mark- 
edly greater than the number of girls, as shown by the respective propor- 
tions, 78.4 and 61.2 per cent. Correspondingly, more than twice as many 
girls as boys lost I.Q. points, the proportions being 38.8 per cent and 17.7 
per cent (Table 3). 











Constancy of the Intelligence Quotient 51 


From Table 4 it may be noted that 51 per ceni of girls changed eleven 
points or more, while the record for boys is 45 per cent. On this basis 
the I.Q. change of the latter was somewhat less than that of girls. 

The changes for subjects tested with the Merrill-Palmer (Stutsman) 
Scale and retested with the Terman-Merrill are given in Table 3. Of the 
78 subjects, 97.4 per cent changed in I.Q., the mean total change being 
13 1.Q. points. It should be observed that this amount is almost identical 
with that for subjects tested and retested with the Binet Scales. For 
those receiving the Merrill-Palmer Scale the tendency was for the retest 
to register a gain. An increase was recorded for 73.1 per cent, the mean 
gain amounting to 15.2 I.Q. points. As shown in Table 4 nearly half 
the subjects, 47.5 per cent, changed eleven or more I.Q. points. For 
those tested and retested with the Binet Scales the proportion, 45.5 per 
cent, was nearly as large (Table 4). As with the previous group, the 
tendency was for the retest to show a gain, this being the case in 59.1 
per cent of the cases (Table 3). 

As mentioned previously, 42 cases were tested and retested between 
the ages of 3-12 to avoid the criticisms which arise in connection with 
other ages. In Table 3 it will be observed that 97.6 per cent of these 
subjects showed a change in I.Q., and the mean total change averaged 
11.4 1.Q. points. Most of these changes were gains, 64.3 per cent having 
larger 1.Q.s on the retest. It will be recalled that the percentage with 
changes in I.Q. was 98 per cent when all ages and intervals (for 100 sub- 
jects) were included. The proportions in the two groups changing eleven 
or more I.Q. points were 48 per cent in both instances (Table 4). 

The percentage of cases showing I.Q. variations of ten points or less 
ranged from 33.5 to 71.5 for the several groups (Table 4). The propor- 
tions changing eleven points or more ranged from 28.6 to 66.7. The 
group with the least fluctuation, as recorded in Table 4, was composed of 
subjects having I.Q.s of 115 and higher. The group with the greatest 
change was made up of cases with I.Q.s of 93-114. Apparently, the 
failure of these preschool subjects, of an average age of four years, to 
demonstrate their superior ability on the initial intelligence tests was an 
important factor in producing a relative lack of constancy in the intelli- 
gence quotient. . 

The results of Group E are given in Table 5. Thirty children were 
first tested with the Stanford-Binet (1916 Revision) and retested with 
the Psychological Examination of the American Council on Education, 
after a mean interval of seven years three months. The mean I.Q. for 
the initial test was 119, and the mean P.R. for the A.C.E. was 44. Na- 
tional norms were used in obtaining the percentile ranks. Since there is 
no acceptable method of transmuting the P.R.s into I.Q.s comparable 














52 Mary Elizabeth Allan and Florene M. Young 











Table 5 
Test-Retest Results for Group E 
Interval First Test Second Test 
No. Range Mean ry oe 8.D. row ot 8.D. 
30 4:10-12:1 7:3 9:8 119 19.40 _— 44 30.80 





to those used in the Binet tests, it is difficult to make comparisons. A 
mean I.Q. of 119 means that the group was superior in comparison with 
relatively unselected subjects of comparable ages. Probably most of the 
subjects would fall in the best ten per cent of cases of these age levels. A 
percentile rank of 44 would be interpreted to mean that the group was 
slightly below the national average, which is a P.R. of 50, when compared 
with the rather highly selected college students whose records compose 
the national norms of the A.C.E. It should also be recalled that much 
of the material in this test involves reading ability, whereas the Stanford- 
Binet contains very few tests involving reading. The mean age at the 
first test, nine years eight months, probably is an important factor also, 
since the problems arising from the immaturity of preschool children 
would not be present for this group. 

The correlations (Pearson-product-moment) between the initial tests 
and the second tests are given in Table 6. They range from .22 + .090 








Table 6 
Correlations between Tests and Retests for Specific Groups 

Group No. Mean Interval Correlation 
OE ES nn, os oe tonecueen 26 1:10 43 + .110 
B (Interval of 3-5:11)................... 36 4:6 46 + .088 
SM Os Sec cdssncccevens 38 8:10 48 + .084 
EEE EEO ee ee 100 5:6 42 + .056 
pf | Se ce 30 73 .59 + .080 
EARS BOD OMG OUGR..). oo ive cece cee cees 49 5:7 32 + .087 
eds vit ions: dieses Katee nied ar oll 51 5:4 .22 + .090 
te eed 6s 8 Oss ak <eae tae sass oe om 51 5:8 .32 + .086 

SG ccidalis Gu stebnins VhescllePoick 49 5:3 41 + .080 
Stutsman I.Q.s—Binet I.Q.s.............. 78 5:5 46 + .060 
Stutsman P.R.s—Binet I1.Q.s............. 78 5:5 31 + .069 
PR A Aba aes wena de's se ib dabs 22 5:8 37 + .123 
Tested between ages 3 and 12............. 42 3:9 44 + .084 





(I.Q.s of 93-114) to .59 + .080 (Group E, Binet—A.C.E.). The correla- 
tions are low when compared with those of several studies of this type. 
The use of relatively homogenous groups may have depressed the cor- 
relations somewhat (9, pp. 199-200). Moreover, the initial records of 














Constancy of the Intelligence Quotient 53 


the 100 subjects having individual tests were obtained at a mean age of 
four years. As previously stated, it is possible that some of the young 
subjects did not put forth their best efforts, and the correlations may 
reflect the influence of this factor. 

The second and third highest correlations, .48 + .084 and .46 + .088 
are for Groups C and B, with intervals of 6-12 years, and 3 to 6 years, 
respectively. It will be recalled that statistically significant differences 
between initial and second tests were obtained (Table 2) for these groups. 

Stutsman I.Q.s and Binet I.Q.s likewise have a correlation of .46. 
The Stutsman P.R.s and Binet I.Q.s show a correlation of only .31 + .069 
(Table 6). As previously stated, Stutsman (11) provides tables for trans- 
muting scores into I.Q.s. The correlation for the former group, Stutsman 
I.Q.s—Binet I.Q.s, is nine points higher than that obtained for test-retest 
results which employed Binet tests in both series (Table 6). It should 
be recalled, however, that the difference between the means for the 
Stutsman I.Q.s—Binet I.Q.s proved to be more than twice as large as 
the difference between the means of the Binet I.Q.s—Binet I.Q.s (Table 2). 

For the 26 members of Group A, with an interval of more than one 
but less than three years, the correlation between the first and second 
tests is .43 + .110 (Table 6). This is of particular interest since the 
critical ratio of 1.10 indicates that the difference is not statistically sig- 
nificant (Table 2). 

For Group D, which is composed of 100 subjects receiving individual 
tests throughout, the correlation between the initial and second tests is 
42 + .056 (Table 6). The critical ratio of 5.41 means that there are 
100 chances in 100 that a true difference between test results exists 
(Table 2). 

The correlation for I.Q.s 115 or higher, although only .32 + .087, is 
ten points higher than that for subjects having I.Q.s of 93-114-(Table 6). 
It will be recalled that the differences between test and retest means for 
these groups proved to be respectively insignificant and highly significant 
(Table 2). 

The correlation of the test results of girls is .41 + .080, which is nine 
points higher than that of boys (Table 6). For both groups the differ- 
ences between the initial and second tests proved to be reliable ones 
(Table 2), but the difference for boys is nearly twice as great as that 
for girls. 

Correlations for the 42 subjects tested and retested between the ages 
of 3-12, and for the 100 subjects receiving individual tests throughout 
(Group D), are nearly the same, .44 and .42, respectively (Table 6). 
As shown in Table 2 the differences between the first and second tests 
are significant for both series. 











54 Mary Elizabeth Allan and Florene M. Young 


Case Studies 


Inasmuch as large fluctuations in I.Q. have been reported in this 
study a few case studies are presented. These may reveal some factors 
affecting the constancy of the intelligence quotient. From these and 
other case studies it appears that young children may be noticeably in- 
fluenced by the insecurity which comes from loss of a parent or from 
frequent change of home, by the type of treatment which engenders lack 
of self-confidence and prolongs immaturity, and by lack of educational 
opportunities. It also seems probable that some subjects born prema- 
turely may be influenced by this fact, especially during the early years. 
The writers do not assert that these factors are causative, but they are 
the only possible explanations obtainable in several instances. 

Case no. 30. The fact that the birth of this boy was three months 
premature may have affected his early development. At the time of the 
initial test, when he was four and one-half years of age, the Merrill- 
Palmer Scale was used and the examiner reported an erratic performance. 
At that time the subject insisted on counting aloud and asking questions, 
types of behavior which penalized him on some of the timed items. The 
child was also distractible and lacked self-reliance. The gain of 45 I.Q. 
points on the retest, the Terman-Merrill Revision, was recorded eight 
years nine months after the initial test. In the interim the boy has had 
excellent opportunities at home, at school and in group organizations. 
His school work has been consistently superior, and he now shows com- 
mendable independence and self-confidence. 

Case no. 3. This boy was first tested at the age of three years nine 
months with the Merrill-Palmer Scale, a few weeks after entering Nursery 
School. He had been adopted at an early age by a superior couple. 
The death of his foster father made it necessary for the foster mother 
to work outside the home, and the emotional shock, the adjustment to a 
new environment, as well as other unknewn factors, may have influenced 
the record on the initial test. The child showed lack of concentration 
and a marked desire to attract attention. After two years five months 
the Terman-Merrill Revision was given and there was an increase of 23 
1.Q. points over the earlier record. The foster mother reported that she 
had made an effort for several months to check the child’s development 
by use of questions prepared for various ages in a nationally distributed 
magazine. This type of exercise may have served as ‘unconscious 
coaching.” The subject has made a splendid adjustment to his environ- 
ment and is doing excellent work in a private school in the first grade. 

Case no. 6. In this case the gain of 38 1.Q. points over an interval of 
three years four months is probably due, to some extent, to a change in 
the attitude of the parents toward their only child. When the subject, 





Constancy of the Intelligence Quotient 55 


a boy of 32 months of age, entered the Nursery School his parents treated 
him as though he were a baby. The Merrill-Palmer Scale was adminis- 
tered a few weeks after his entrance to the school and he demonstrated 
lack of self-reliance, distractibility, and other types of reactions which 
often characterize immaturity. Throughout the period between tests the 
child was encouraged to develop self-confidence, and to master the skills 
typical of his age. His parents cooperated whole-heartedly and they 
have made special efforts to give the boy numerous opportunities to 
learn. Seemingly, these factors may have influenced the excellent results 
obtained on the Terman-Merrill Revision, which was used for the retest. 

Case no. 10. This subject, a girl of five years at the time of the 
initial test, the Terman-Merrill(L), showed marked self-assurance and 
poise. At the time of the first test she was in Nursery School, but when 
retested a year later she was not in school, although she was of school age. 
This may have been a factor of some importance, since several items which 
she failed probably would have been influenced by school attendance. 
During the intervening year there had been much illness in the child’s 
home and this fact may have reduced the time available for giving the 
subject educational opportunities. At the time of the retest, when the 
Terman-Merrill (M) was administered, the child seemed to be self- 
conscious and lacking in confidence. She was aware of many of her 
failures and her mother reported that she cried when she reached home, 
saying that she knew she did not do well on the things she was asked to do. 
Although this subject showed a loss of 22 points the ‘retest 1.Q. of 123 
would still indicate superior ability. 

Case no. 95. This subject, a girl of two years seven months, was given 
the Merrill-Palmer Scale a few weeks after she entered Nursery School. 
At this time the child lacked a permanent residence. Her father had 
died and the mother had obtained remunerative employment. The 
little girl was cared for by relatives and taken from one home to the other. 
It is possible that the results of the disturbance and the feeling of inse- 
curity were among the factors influencing the relatively low score on the 
first test. During the interval between tests the mother remarried and 
the child was permanently located with her maternal grandparents. It 
is possible that the greater security represented by a definite home and 
the better adjustment which was evidenced by the child may have in- 
fluenced the retest record. On the second test, the Terman-Merrill 
Revision, administered at the age of six years five months, there was a 
gain of 28 I.Q. points. 

Case no. 12. This boy was first tested at the age of eight years two 
months at the request of his physician, who desired to know his mental 
rating. For several years the boy had been subject to frequent, severe 








56 Mary Elizabeth Allan and Florene M. Young 


headaches. On the Stanford-Binet (1916 Revision) his I.Q. was 110. 
At the time of the test he was free of headache and he asserted that he was 
feeling well. Clinical notes made at the time of the first test indicate 
that the mother was in a serious toxic condition during the later months 
of pregnancy. Also, birth conditions were abnormal, and the mother 
believed that there was a likelihood of some birth injury. From infancy 
the child was sickly and under constant medical care. He lost consider- 
able time from school because of poor health. After the initial test the 
child was under the care of a physician until his health became normal. 

The subject was sixteen years old when retested with the Terman- 
Merrill (1937 Revision). Seemingly, his physical condition was good, 
with no evidence of the former disabilities. The retest showed a gain 
of 22 I.Q. points. This gain may reflect improved health, which made 
possible regular school attendance. The boy volunteered the statement, 
‘‘T feel like studying and working since I have gotten well.”’ 

Case no. 92. A loss of 22 1.Q. points was recorded for a little girl who 
was tested at the age of three years three months and retested at five 
years eight months. The Merrill-Palmer Scale was used for the initial 
testing and the Terman-Merrill for the second testing and this may 
account for some of the change. It seems possible, however, that the 
maladjustment arising from jealousy of an infant brother may have had 
some influence. Until the birth of the sibling the subject was the center 
of attention, and numerous relatives showered her with affection and 
gifts. The arrival of a competitor brought about a distinct personality 
change. The subject showed sullenness, negativism, and timidity. Her 
mother stated that her play tended to be more solitary and quiet in type. 
The second test was given during the period when the maladjustment 
was an acute problem in the home. In the test situation the child seemed 
to be reasonably cooperative, but somewhat cool and aloof. It should 
be stated that after much effort, the parents have succeeded in improving 
the adjustment of the subject. 

Case no. 90. ‘A child who was first tested at seven years and retested 
at nine years three months had respective I.Q.s of 112 and 132. The 
subject had two younger siblings and the mother reported that she was 
unable to give the children the best educational guidance during the 
preschool years of the oldest child because she was too busy. At the 
time when the subject entered the first grade, the baby was put in the 
Nursery School. The mother, aided by several advisors, then planned a 
stimulating, enriched home environment for her children, and whole- 
heartedly set about providing for their growth. Any successes were made 
occasions for family rejoicing. Both parents attended any public per- 
formances in which the subject participated. The mother described her 








Constancy of the Intelligence Quotient 57 


little daughter’s development as being ‘‘plainly visible even to one who 
doesn’t know about mental tests.”’ 

Case no. 43. This adopted child was first tested at the age of three 
years nine months with the Merrill-Palmer Scale, and an I.Q. of 126 was 
recorded. One year later the Terman-Merrill was used and the I.Q. was 
111. A difference in tests may account for part of the change in I.Q., 
but it seems possible that emotional factors may have been contributory. 
The child’s home was the scene of frequent and bitter conflicts between 
the adults, and it is believed that these disturbed her more than the 
parents realized. Stormy scenes often occurred in her presence. Lack 
of security and the existence of constant tension may have distracted 
the little girl from wholesome educational activities, and thereby in- 
fluenced achievement. 

Case no. 55. This boy was first tested at the age of two and one-half 
years, after he had been at the Nursery School for several months. When 
he was retested three years five months later his I.Q. was 115, fifteen 
points higher than on the first test. Part of the change may be attributed 
to the difference in the tests, the Merrill-Palmer and the Terman-Merrill, 
which were used. However, some of the difference may be due to an im- 
proved adjustment on the part of the boy. In infancy he had become 
attached to a large stuffed-dog which he insisted on keeping with him at 
all times. He brought the toy to the Nursery School and it was gradually 
removed and his attention directed to other toys. At home the fixation 
was allowed to persist for a much longer period, and after an absence from 
school a relapse usually occurred. It is possible that development was 
somewhat retarded by the fact that much of the child’s time and energy 
were consumed with one toy instead of being utilized in a balanced whole- 
some program. 

Case no. 8. The initial test record of this subject, a girl aged two 
years three months, indicates that cooperation was spasmodic. The 
child frequently said to the examiner ‘“‘ You do it,” and showed other 
evidences of lack of self-reliance. Because of the prolonged illness of 
the child’s father the mother had secured remunerative work. The care 
of the child had been left to an unskilled servant, before she was enrolled 
in the Nursery School. A definite change in behavior was observed at 
school, where there was insistence upon self-help and many educational 
opportunities. The child remained in the Nursery School until she was 
five years of age. A retest was made at the age of ten years nine months 
with the Terman-Merrill (1937) and the I.Q. was 44 points higher than 
on the initial test, on which the I.Q. was 115. Part of the difference may 
be attributed to the difference in tests, the Merrill-Palmer having been 
used for the first testing. It is apparent, however, that opportunities 








58 Mary Elizabeth Allan and Florene M. Young 


for the several phases of development were relatively inferior prior to the 
first test. By contrast, both home and school environments were superior 
during most of the interval between tests. 


Summary and Conclusions 


The present study of the constancy"of the I.Q. is based upon 130 sub- 
jects who were retested within one to twelve years after the initial test. 
Individual initial examinations were given to the 100 members of Group D 
at the mean age of four years, and individual retests were administered 
to the same subjects at an average age of nine years five months. Indi- 
vidual initial tests were given to the 30 members of Group E, but these 
subjects received the Psychological Examination of the American Council 
on Education for the retest, after a mean interval of seven years three 
months. 

1. On the basis of the critical ratio technique the results of the study 
of Group D indicate that the I.Q. is relatively constant, that is, the dif- 
ferences between the initial and second tests are not statistically signifi- 
cant, under the following conditions: 


(a) When the subjects are retested after 12 to 36 months. 

(b) When the subjects have initial I.Q.s of 115 or higher. 

(c) When the subjects receive Binet tests, either the Stanford Re- 
vision of 1916 or the Terman-Merrill Revision of 1937, both for the initial 
and final tests. 


2. If the I.Q. which changes six points or less is regarded as constant, 
the following sub-groups of Group D may be regarded as having constant 


1.Q.s. 


(a) Group A, for whom the interval between tests is 12 to 36 months. 

(b) Subjects having I.Q.s of 115 or higher on the initial test. 

(c) Girls. 

(d) Subjects receiving Binet tests both for the initial and second 
examination. 

(e) Subjects tested and retested between the ages of three to twelve 
years, at which levels the tests are open to less criticism. 


3. One measure of the constancy of the I.Q. is the percentage of each 
group changing ten or less I.Q. points. According to this criterion the 
groups with the best records are as follows: 


(a) 1.Q.’s of 115 or higher on the initial test............ 71.5% 
(b) Group B (interval of 3 to 6 years).................. 61.2% 
GN, 35 ba 69d. UUs oe scam Bias) 54.9% 


(d) Binet test—Binet retest.................00000005. 54.6% 








Constancy of the Intelligence Quotient 59 


4. Of the 100 subjects receiving individual tests throughout, Group D, 
98 per cent showed a change in I.Q. on the retest. The mean total 
change was 12.8 1.Q. points. Groups with more than 74 per cent of their 
members gaining are listed below: 


(a) 1.Q.s 93-114 
(b) Boys 
(c) Group B (interval 3 to 6 years) 


In only three groups did more than 40 per cent of the subjects show 
losses, the mean losses ranging from 3.4 to 9.4 1.Q. points. The groups 
were: 


(a) 1.Q.’s 115 or higher 
(b) Group A (interval 1 to 3 years) 
(c) Binet test—Binet retest 


5. The correlations (Pearson product-moment) are lower than those 
obtained by several investigators. Groups for whom the correlations 
between the first and second tests are .45 or higher, are listed below: 


(a) Binet test—A.C.E. retest..................... .59 + .080 
(b) Group C (Interval 6 to 12 years)............... 48 + .084 
(c) Group B (Interval 3 to 6 years)................ 46 + .088 
(d) Stutsman I.Q.’s—Binet I.Q.8s.................. 46 + .060 


6. Seemingly, some of the most marked fluctuations in I.Q. were in- 
fluenced by such factors as insecurity attending the loss of a parent, by 
frequent or recent changes in the home, by conflict in the home, by treat- 
ment which engendered lack of self-confidence and prolonged immaturity, 
by unusual changes in educational opportunities, and by poor health. 

7. The constancy of the I.Q. varied according to the criterion of 
constancy which was proposed. It seems to have been influenced by such 
factors as length of interval between tests, types of tests used, sex, intelli- 
gence level, and age of subjects. 


References 


1. Brown, R. R. The time interval between test and retest in its relation to the 
constancy of the intelligence quotient. J. educ. Psychol., 1933, 24, 81-95. 

2. Cattell, P. Intelligence of infants and its measurement. Trans. N. Y. Acad. Sci., 
1941, Ser. II, Vol. 3. Pp. 162-171. 

3. Cooke, D. H. Minimum Essentials of Statistics. New York: Macmillan, 1936. 
Pp. xviii + 271. 

4. Davis, E. A. The development of linguistic skill in twins, singletons with siblings, 
and only children from age five to ten years. Univ. Minn. Inst. Child Welf. 
Monog. Ser., 1937, No. 14. Pp. x + 165. 








10. 


11. 


12. 


13. 


14. 


Mary Elizabeth Allan and Florene M. Young 


. Foran, T. G. The constancy of the intelligence quotient: a review. Cath. Univ. 


Amer. educ. Res. Bull., 1926, Vol. 1, No. 10. Pp. 40. 


. Goodenough, F. L., and Anderson, J. E. Experimental Child Study. New York: 


Century, 1931. Pp. xii + 546. 


. Gray, P. L., and Marsden, R. E. The constancy of the intelligence quotient: final 


results. Brit. J. Psychol., 1926, 17, 20-26. 


. Hildreth, G. Stanford-Binet retests of 441 school children. Ped. Sem., 1926, 33, 


365-386 


. Odell, C. W. Statistical Method in Education. New York: D. Appleton-Century, 


1935. Pp. xx + 457. 

Rugg, L. 8. Retests and the constancy of the 1.Q. J. educ. Psychol., 1925, 16, 
341-343. 

Stutsman, R. Mental Measurement of Preschool Children. New York: World Book, 
1931. Pp. x + 368. 

Terman, L.M. The Measurement of Intelligence. Boston: Houghton Mifflin, 1916. 
Pp. xviii + 362. 

Terman, L. M., and Merrill M. Measuring Intelligence: A Guide to the Administra- 
tion of the New Revised Stanford-Binet Tests of Intelligence. Boston: Houghton 
Mifflin, 1937. Pp. 461. 

Young, F.M. An analysis of certain variables in a developmental study of language. 
Genet. Psychol. Monogr., 1941, 23, 3-141. 





Agrrie rene” ane 


\ 
é 





An Outline for the Systematic and Comprehensive Study 
of a Personality 


Clarence Leuba 
Antioch" College 


Most outlines for the study of a personality have been limited to the 
point of view of some one “‘school”’ of psychology, or to allegedly measur- 
able aspects of personality, or to those aspects pertinent to a particular 
practical purpose, such as vocational guidance. As far as the writer is 
aware, little progress has been made toward the development of an outline 
for the systematic and comprehensive scientific study of a personality 
since that published almost two decades ago in Chapter 5 of Floyd All- 
port’s Social Psychology. 

The purpose of the present outline is to suggest the data required for 
understanding a personality and for reasonably successful prediction and 
control. It uses terms like attitudes, aptitudes, interests and habits, 
which are generally used and reasonably well defined by psychologists, 
and it attempts to avoid the limitations of any one particular approach or 
“school” of psychology. It is based on the most generally accepted 
conception of personality as consisting of everything which distinguishes 
the person from others and makes him unique.' This is less inclusive 
than to consider personality as everything that a person is, but more 
inclusive than to consider it as the impression made on others. The 
latter definition leaves out private, non-socially expressed aspects of 
individuality and the former makes personality co-extensive with psy- 
chology. 

The outline includes three approaches to personality: the analytic, 
synthetic and genetic. The first two represent a cross section of the 
personality as it is at any given moment; the third describes its origin, 
development and trends. 

Some students of personality may consider the third approach as 
superfluous for understanding a personality as it is at present. But 

1 This is similar to Gordon Allport’s definition of personality as “the dynamic 
organization within the individual of those psychophysical systems that determine his 
unique adjustments to his environment.” Personality, a psychological interpretation, 
p. 48. It is also similar to Ross Stagner’s definition: ‘“‘. . . those responses which deter- 


mine individuality, that quality of being unique and different in the eyes of others.” 
Psychology of Personality, p. 4. 


61 











62 Clarence Leuba 


traits are not static entities; they have developed from certain origins in 
the past and they are progressing in certain directions. To understand 
the liberalism of a certain individual and to predict his behavior it is 
important to know that his liberalism developed from an early radicalism 
and is gradually becoming more conservative. Furthermore, a liberalism 
developed on rational grounds, is different from one arising as an emo- 
tional reaction against dogmatically conservative parents. These are 
but a few examples of the value of the genetic approach for understanding 
the personality in the present. 

To follow first one approach and then another, as indicated in the 
outline, would be tedious and usually impractical. It would arbitrarily 
separate characteristics which actually have lived and grown together. 
Most students of personality will probably prefer to consider at least the 
analytic and synthetic approaches simultaneously; or, at any rate, while 
possibly emphasizing one approach at a time, they will prefer not to ignore 
the others completely. The outline is designed not to be slavishly fol- 
lowed, but to function as a reminder of the aspects of personality which 
need to be covered somewhere in any study that aims at the comprehen- 
siveness required for understanding, predicting and controlling. 

The outline includes those approaches to personality study whose 
efficacy for purposes of prediction was recently investigated by Polansky.” 
It emphasizes them roughly in proportion to their value as indicated in 
that investigation. Polansky concluded that the description of the pat- 
tern of traits as they existed at the moment in the personality plus that of 
the cultural setting, social relations and genetic background were the 
most revealing approaches for purposes of prediction. After these, in 
decreasing order of importance, were the description of episodes, of test 
scores and of the major maladjustment. Episodes are probably most 
effective if they follow upon a careful definition and description of a trait 
in general terms; they serve to illustrate it, and make its meaning more 
explicit and definite. 

In the absence of technical devices, such as tests or rating scales, for 
measuring a trait, one can resort to other means for indicating as accu- 
rately as possible the extent to which an individual possesses a trait. 
There are at least four ways of doing this. The individual may be com- 
pared to other persons with reference to the trait (informal man-to-man 
rating); he may be likened to some well known person in this particular 
respect; the number of situations in which he exhibits the trait, and the 
situations required to prevent its appearance, may be indicated; or the 
strength of the trait may be compared with that of others in his per- 


* Polansky, N. A. How shall a life history be written? Character & Pers., 1941, 9, 
188-207. 





scala agh ale ap 8. ti Ae, 2 an pm Pryce 


ee vl 











3 
% 
£ 





rs 


Veale od ele 


An Outline for the Study of a Personality 63 


sonality. For,traits which are unique to the individual, and which are, 
therefore not generally shown by other people, the first two methods are 
not applicable. In most respects, however, a person’s uniqueness resides 
not in possessing traits totally absent in others but in the extent to which 
he possesses common traits and in their particular organization and 
significance in the total personality. 


An Outline for the Systematic and Comprehensive Study of a Personality 


I. Introduction 


1. Sources of information: the person himself, his record, associates, personal observa- 
tions, etc. 

2. General setting, culturally, economically, etc. 

3. Thumbnail sketch of the person; his chief characteristics. 


II. Analytical Approach 
1. Physical characteristics. 
Physique. 
Posture, gait, gestures, manners and mannerisms. 
Muscular coordination, strength and endurance; reaction time. 
Activity level, energy and vitality. 
Impulsiveness-inhibition. 
Tension-relaxation. 
Sensory sensitivity. 
Health; resistance to disease. 
Physical abnormalities, if any. 
2. Intellectual ability (intelligence). 
Observational (perceptual) capacities. 
Memory. 
Facility in the use of symbols, such as words and numbers. 
Capacity for thinking and imagination; ability to see relationships and to make 
discriminations. 
3. Basic aptitudes (partly innate and partly acquired). 
Musical, graphical, manipulative, mechanical, athletic, etc. 
4. Special acquired skills in— 
Various trades, such as carpentry, bricklaying, printing, etc. 
The use of musical or other instruments in the arts. 
Athletics, such as tennis or golf. 
Various professions, such as law, medicine, teaching, etc. 
Housekeeping, such as cooking, sewing, ironing, etc. 
Dealing with people, as in influencing, pleasing, or leading them. 
Business dealings, as in bargaining, making the most of opportunities, making a 
profit, etc. 
Miscellaneous occupations and hobbies. 
5. Knowledge of facts, theories and principles in various aspects of life. 
Information pertaining to the sciences, languages, social relations, sex, religion, 
homemaking, business, politics, etc. 


*Lombardi, M. M. The inter-trait rating technique. Teach. Coll. Contr. Educ., 
1938, no. 760. 











64 Clarence Leuba 


6. Dynamic or motivating factors. 
(a) Emotional and affective make-up. 
Excitability and sensitivity; strength and duration of various emotions. 
Variations from usual emotional level. 
Number and variety of emotionally stimulating situations; likes and dislikes, 
aversions, fears, sources of irritation and disgust, etc. 
Extent and methods of control. 
Outlets. 
(b) Sexual characteristics. 
Strength of sex drive. 
Methods of satisfaction. 
(ce) Ambitions, purposes and interests both immediate and long run; suscepti- 
bility to various incentives, such as rivalry. 
7. Attitudes or habits with reference to— 

Himself: self-confidence, insight, modesty, self-disipline, cleanliness, neatness, 
dress, grooming, etc. 

Sexual matters. 

Other people, such as relatives, friends, business associates or members of the 
opposite sex: degree of considerateness, aggressiveness, cooperativeness 
friendliness, etc. 

A duty, job or undertaking of any kind: habits of work, punctuality, personal 
efficiency, etc. 

Obstacles, difficulties and frustrations: persistence flexibility, stubborness, de- 
fense mechanisms, etc. 

Property: carefulness, responsibility, honesty, etc. 

Social questions and institutions: radicalism-conservatism; attitudes toward the 
church or other specific institutions, etc. 

Reality and the world in general: realism, idealism, optimism, curiosity, etc. 


III. The Synthetic Approach 
1. The relation of traits to specific situations; specificity vs. generalization; consistency 
within the self and consistency as determined by social standards. 
2. The interrelation of traits and the organization of personality. 
The organization of traits with reference to motives, interests, purposes, values 
and life philosophy. 
Dominant traits. 
Clusters or patterns of harmonious traits. 
The functioning of various traits in life adjustments, as in the home, business 
or school. 
Conflicts among traits; maladjustments. 
Compensations. 


IV. Genetic Approach 
1. Family and educational background. 
2. Development of the characteristics listed under II and III. 
3. People and circumstances influential in the development of the personality. 








nS 





———— - 





The JNB Psychograph 


John N. Buck 
Senior Psychologist, Lynchburg State Colony, Virginia 


It has been said that nowhere else on earth is so much good paper 
wasted as in the average clinic or hospital for persons suffering with men- 
tal disease or deficiency. 

Perhaps this is a gross misrepresentation of the facts. But it does 
seem strange that nowhere in the welter of paper of which the average 
clinical or hospital record is composed can one find a single form on which 
is given information concerning a patient’s mental condition that is in 
any way comparable to the breakdown of the physical condition contained 
on the physical examination form sheet. 

It was in an attempt to fill this obvious gap in clinical recording that 
the JNB Psychograph was drawn up in 1938. Since then, the graph has 
been revised and expanded four times; it is still not ideal in format. But 


experience has shown that on the present graph one can produce a vivid 
and concise breakdown of the mental pattern presented by a patient. 
And even if the form had no other merit, the fact that upon it ome can 
compare point by point the intellectual and personality picture presented 
by a patient on two or more distinct and separate occasions would seem 
to justify its clinical employment. 


Description of the Form 


The sheet is divided vertically by a double red! line, and by five 
single black lines. The double vertical line is to be regarded as represent- 
ing the hypothetical “average.”” Everything to the right of that double 
line is arbitrarily assumed to be an abnormal or subnormal trait; every- 
thing to the left of the double line is postulated to be some quality or 
ability that is above average in calibre. In the majority of instances 
the degree of variation (to the right or the left) from the hypothetical 
average is represented by the actual distance of the descriptive term 
from the double line. 

The form is divided horizontally into twenty-one major subdivisions. 
The first nine general headings (reading downward from line one) repre- 
sent an analysis of the subject’s intellectual capacities and serve as a 

To avoid needless printing expense, black ink has been employed exclusively in 
this paper—and broken and solid lines have been used instead of lines of different colors. 

65 





66 John N. Buck 


rough breakdown and elaboration of the figures on line ten—the psycho- 
metric scores themselves. The mere psycho-mechanical factors involved 
in the acquisition, consideration and solution of a given problem, of 
course, are more numerous than those considered here—and in many 
instances the factors tend to overlap, but no one yet has found away to 
determine with any exactness the limits of a given faculty or the relative 
part it may play in the solution of a given problem. Nevertheless, the 


nine factors employed here are certainly of major importance in any 
general appraisal of intelligence—and any marked inter-factor weakness 
is usually of diagnostic significance, and not infrequently a prognostic 
aid as well. 

The second ten factors (lines number 11 to 20 inclusive) are intended 
to represent the major supplementary and amendatory factors—many of 
which go to make up that which is loosely called ‘ personality’’—that 
determine to so great a degree the extent to which an individual may 
make profitable use of his basic intelligence. 

The twenty-first line represents a summation and weighing of the 
twenty separate factors considered above it. 


Method of Use 


It has been found most convenient to employ a small ‘‘o”’ to indicate 
the degree of a particular trait or quality exhibited by the subject. To 
illustrate: if the subject’s reaction time is slower than average, the 





examiner will record that fact as follows: Slow 0 If, on the other 
hand it is more rapid than is usually the case, that fact will be recorded 


as: fo Rapid jj. If the subject in question first shows average reaction 





time, but later, due to fatigue, it becomes progressively slower, the method 
| 
of recording will be: | ¢ The arrow is used to indicate (by 








its direction of point) trend tendency. 

The “‘o”’ may, of course, appear within more than one box on the same 
line: for example, an individual may have powers of reasoning and judg- 
ment that are above average so far as his ability to perform standard 





rar: 





The JNB Psychograph 67 


psychometric tests is concerned, but at the same time he may have 
definite delusions of persecution. In such a case the examiner will place 
one “‘o”’ in the first box to the left of the double line (on line 3); and he 
will place another “‘o” in the third box to the right of the double line. 
He will also write “‘persecutory”’ in the empty box (third to the left of 
the double line) to define the type of delusion shown by the subject. 

On line 10 the examiner will enter the score made by the subject on 
whatever psychometric tests have been employed (the figures printed on 
line 10 are intended to represent mental age). The ratio between the 
scores obtained on tests that are essentially verbal in character and those 
that are non-verbal (or performance) in type is often of very real diag- 
nostic value—hence all are to be shown. 

If the Wechsler-Bellevue Scale has been used, the examiner will write 
‘“‘W-B” immediately to the right of the words Psychometric Score; he will 
then place the “‘o” at the appropriate point on the line to indicate the 
Total ietallienace Quotient; place a ‘‘V” at the proper spot to indicate 
the Verbal Score; a “‘P”’ at the right position to indicate the Performance 
Score. 

If he has used the Terman-Merrill Form “L,’’ he will write TML to 
the immediate right of Psychometric Score; place a ‘‘B”’ at the proper 
point on the line to indicate the basal mental age; place an “‘o”’ for the 
final mental age figure; put an ‘‘R”’ (for range) at the teal place 
on the line to indicate the highest year division in which the subject 
scored any success. 

If a performance scale such as the Cornell-Coxe is employed, the 
examiner will use an ‘‘o” to denote the final mental age score; place an 
“R” at the proper an to indicate the highest single test score made 
and also the lowest single test score made (to indicate roughly the breadth 
of inter-test score scatter). 

The examiner will find it convenient to keep the graph before him 


while he is interviewing a subject; to insert lightly (in pencil) ‘‘o’s” and 
postils as information is acquired. 

After the form has been completed, the pencilled ‘‘o’s”’ and remarks 
can be erased, others typed in, in their stead. The “‘o’s’’ are then to be 


joined by straight lines, as follows: ‘‘o’s”’ on the same line by a horizontal 
line; the “‘o” farthest from the hypothetical average on a given line to 
the 9 Benn the hypothetical average on the line next below by a 
psa el line, and so on downward through the graph. 

If one will employ red ink for his first profile; black for his second, etc., 
several graphs can be drawn on the same sheet (which makes the form 
that much more valuable for comparative study) and one can quickly 
see the course of the patient’s mental recovery or decline. 











68 John N. Buck 


Discussion of General Headings 


There are 21 general headings with some 181 qualifying descriptive 
terms. There is neither need (nor space) here for a lengthy discussion 
of them all: the glossary in the manual wili take care of that—of course, 
there must be standardization of definition, if the graphs are to mean 
approximately the same thing to everyone who sees them. But it may 
be well to consider briefly certain of the general headings and the de- 
scriptive terms. 

It will be seen that on line one (Perception) consideration is given to 
certain physical factors. A moment’s cogitation will show that this is 
not so odd as it may seem at first sight: for example, if an individual has 
a severe and uncorrected bilateral myopic astigmatism, that condition 
will interfere with his perception; but provision must be made for record- 
ing that fact so that the flaw will not be thought to result from inherent 
mental deficiency. The type of physical defect will always be written 
in the third box to the left of the double vertical line. 

Line two requires no comment. 

Line three is concerned with both reasoning and judgment. This 
condensation was resorted to first in the interest of brevity; second, be- 
cause there is a surprising lack of unanimity of opinion as to whether the 
two terms are actually synonomous or represent two entirely different 
faculties. As used here, Reasoning is postulated to be the ability to 
perform the mere mechanical solution of a problem—in terms of that 
particular problem only. Judgment is assumed to be the faculty that 
permits the solution to be an intelligent one. Definite variance between 
the two is to be noted on the line by use of appropriate initials. 

The term Insight (line four) is used both in the narrow sense “the 
realization of one’s own abilities and limitations”’ and also in the very 
different and broader sense ‘“‘the ability to plan wisely over the long 
range.” Any discrepancy between self-insight and long-range-insight 
should be recorded in the appropriate empty box. 

Line five requires no definitive comment. Any marked discrepancy 
shown by the subject between his ability to repeat digits in the order 
given and in reverse order is usually significant and note of it is to be 
made. Likewise discrepancy between “auditory” and “visual” im- 
mediate recall is to be recorded. 

Line six is concerned with Concentration. A word concerning the 
term “‘Distractibility”: it is postulated that there are three types of 
distractibility: (1) The “shallow” type, which is most frequently seen in 
mental deficiency and which is characterized by loss of ability to concen- 
trate in the presence of any simple disturbing element; (2) The “‘extro- 
versive”’ type; most frequently seen in hypomania and mania, in which 























The JNB Psychograph 69 


the pressure of ideas welling up within the subject’s mind prohibits his 
concentrating on the problem at hand; (3) the “‘introversive” type, most 
commonly seen in cases of dementia praecox. The “type” of “dis- 
tractibility’”’ will be stated in the empty box to the left of the line. 

Conceptual Level requires no definition. 

Drive is assumed literally to be the force that enables the subject to 
complete a task. There may be seen in one and the same subject at one 
and the same time scantness of drive, mild fatigue, and mild excitement. 

Line nine refers to mental reaction time—not physical. 

Line ten has already been discussed at considerable length. 

For line eleven nothing need be said except that at the Lynchburg 
State Colony the subject’s score on the Time Test ? determines the point 
at which the “‘o” is placed on the subdivision line headed ‘‘Time.’”’ The 
attempt here is to use the term Orientation in a very broad sense. 

Line 12 is concerned with an attempt at a rough appraisal of a sub- 
ject’s ability to meet sudden changes or adverse circumstance with 
equanimity. If actual “flight” is shown, the examiner will record the 
type in the box third to the left of the double vertical line. 

Two of the subheads under Verbal Expression are ‘flow’ and “tone.” 
“Flow” is to be construed as both volume and force. ‘‘Tone’’ is literally 
the character or sound of the patient’s voice. The examiner will find it 
helpful to enter in the empty box to the left remarks on the subject’s 
spontaneity; his method of reply (direct and frank, oblique and evasive, 
etc.) and so forth. ‘‘Question Block” is a term used to define a charac- 
teristic frequently seen in persons who have undergone mental deteriora- 
tion as the result of epilepsy: the patient (to illustrate) in answering the 
third of a series of questions will somehow still be answering a part of the 
first question asked him. 

The term Affect is familiar to all. 

Item 15 is an important supplementary factor, for after all, trite as 
the comment is, the subject’s attitude toward his contemporaries and the 
problems posed by his environment plays a paramount role in his battle 
for success and happiness. For the sake of brevity but two subheads 
have been employed: in most instances they will suffice. Two of the 
qualifying terms may need a brief word of definition: ‘‘ Christus Complex”’ 
is to be used to describe that relatively rare condition in which an indi- 
vidual—sorely mishandled by Fate—seeks compensation by devoting 
his life almost exclusively to the service of others (missionaries to the 
contrary notwithstanding this is an abnormal trait!). ‘Sanctus Com- 
plex” is to be used to describe the cloak of bitter martyrdom that some 
don in compensation for defeat. The examiner should explain an ‘‘o” 


* An emergency test for the estimation of adult intelligence. 





70 John N. Buck 


placed in the box containing the term “‘indifferent”’ so that one may know 
whether the indifference was, for instance, that of a catatonic praecox 
or that of a simple, happy imbecile, for they are by no means the same. 

In attempting to gauge his subject’s Empathy, the examiner must 
do his utmost to evalue his subject impartially in the light of what ap- 
peals to the “‘man on the street’’—not what happens to appeal to the 
examiner! 

If the subject has any cosmetic disfigurement or any physical dis- 
ability that might reduce his empathy and result in psychic decompensa- 
tion on his part, that fact is worthy of record. If the subject happens 
to have one of the better known facies (Mongoloid, Miserere, etc.) the 
examiner should note that fact in the empty box to the left. It has been 
rather aptly said (it may be noted in passing) that if one had to choose 
between possessing a high empathic index and a high degree of intelli- 
gence, one would do well to choose the former. 

Ambition (line 17) is construed to be the purpose without which the 
Drive would be simply so much force that would be of little use to its 
possessor. 

Ethico-Moral Development (line 18) is a sound differential diagnostic 
point: the three subheads are self-explanatory—any marked difference in 
developmental degree merits careful investigation. 

In estimating the subject’s Convulsive Threshold the examiner will 
attempt to establish—albeit rather roughly—the point at which the 
subject finds his surroundings so intolerable that he seeks escape from 
consciousness. An ‘‘o”’ in the first box to the right of the hypothetical 
average (together with an appropriate explanatory statement in the 
empty box to the left of the line) is to be used to indicate the presence of 
onychophagy, habit spasms, etc. In this instance strict accuracy of 
definition is sacrificed in the interest of economy of space. The examiner 
must bear in mind that frank convulsions are caused by many things 
other than true epilepsy (if indeed there be such a thing as true epilepsy!). 

Item 20, General Behavior cannot be appraised accurately in the 
absence of a competent history. 

Now comes the final question: “‘ What is the subject’s general intelli- 
gence level?” The ‘‘o” on this line is to represent the examiner’s opinion 
of the actual working capacity of the subject. The individual whose mental 
age is approximately 12 years on standard psychometric tests, but who 
by reason of high empathic index, well developed ambition and drive, is 
actually anything but inferior, should be adjudged above average. He 
whose mental age on formal tests is considerably above average, but 
whose adjustment in society is poor indeed, may quite properly be ap- 
praised as below average. 

















The JNB Psychograph 71 


JNB PSYCHOGRAPH No.:__1__. 
(Copyright, 1941, ty John N. Buck) 


Name: BJ. Date:29 Oct. 1941 __ Examiner:_JNB 











Superior Keen 





ORIENTATION 





Ratioon 


_ Definitely superior intelligence; his personality is flawed by a number 
Impression: of paranoid characteristics of whose existence he is to some degree conscious. 











72 John N. Buck 





JNB PSYCHOGRAPH No.: 
(Copyright, 1941, by Joba N. ‘oye 1 





Name: G. 8. Date zo 2 = © 19 Jum {Sdbeaminer: DB 


Superior Keen 





Raton 


~x 
(patient in acute maniacal state) 





A 2 IJwme 
Impression: 19 June 1941: epilepsy, idiopathic, with ment.det. and disintegration of personality. 








The JNB Psychograph 


JNB PSYCHOGRAPH 
OSTA ate HS” 


e ——— = 


Examiner « 








Superior 





People 
Problems: 


Integrity: 
. ETHICO-MORAL Veracity: 
DEVELOPMENT 


. GENERAL BERAVIOR 


Tr Seka 











74 John N. Buck 






















The conscientious mental examiner wishes to produce for the record 
as nearly a holistic picture of his subject’s mental condition as is possible. 
The fact that at the moment there is no known method by which he can 
weigh with mathematical exactitude the relative parts played by en- 
vironmental and organismic factors in the production of that picture will 
not deter him in his effort. 

The following illustrations may serve to give the reader some idea of 
what can be expected from the Psychograph: 

Sheet No. 1 is the graph of an out-patient; a man of definitely superior 
intelligence, whose personality shows mild paranoid warping. Sheet 
No. 2 shows two graphs of in-patient G. H.: the solid line with ‘‘o’s”’ 
shows the picture he presented shortly after admission, when he was hav- 
ing frequent epileptic seizures and was in an acute maniacal state; the 
broken line with ‘‘x’s” shows the picture 17 days later. Sheet No. 3 has 
on it two graphs of in-patient A.C. On admission he was in an acute 
psychotic state (the initial suspicion that this was the result of Dilantin 
intoxication was later confirmed). Ten days later and after Dilantin 
had been almost completely withdrawn he appeared to be an epileptic 
who had undergone some intellectual deterioration; marked personality 
disintegration of the schizoid type. The patient’s Father states that this 
last is the picture the patient presented for a number of years prior to the 
time at which Dilantin administration was begun. 


Summary 


By means of the Psychograph the mental examiner can do two things: 
first, he can present a breakdown of his analysis of an individual’s mental 
condition; second, he can record a graphic picture of that individual’s 
mental condition at more than one stage of his mental illness. 


Psychology in Kansas High Schools * 
W. H. Mikesell 


University of Wichita 


The Kansas Academy of Science has maintained for years an Educa- 
tional Trends Committee to ascertain the status of various scientific sub- 
jects as taught in the Kansas high schools for the purpose of bolstering 
them in case of a discovered weakness. 

It was the general opinion for a long time that psychology courses in 
Kansas high schools were in need of careful study. So in the academic 
year, 1940-1941, an Educational Trends Committee, of which the writer 
was chairman, was appointed to study the problem. From the State 
Board of Education, the information was obtained that 227 high schools 
of the state teach psychology either every year or in alternate years. 
A questionnaire was prepared which inquired chiefly into the preparation 
of high school teachers for instructing in psychology, whether special 
techniques were employed in teaching the subject, whether there was a 
demand for practical psychology, and whether there was a satisfactory 
textbook in the subject. 


Results of First Survey 
Four principal findings showed up in this first survey: 


First, That too often teachers in the subject had very little in- 
struction in the field, and still less training in imparting it. 

Second, That there was no careful technique followed in teaching 
the subject. Often, a textbook which would not be scientifically 
authorized, was employed. 

Third, That there was a pressing demand for practical psychology. 

Fourth, That the state-adopted psychology text by Uhl and Powers 
was not a psychology but a sociology textbook, and that there 
was, consequently, a very urgent need for a regular psychology 
high school textbook. 


When the report was submitted to the Academy, it voted to have the 
committee make a second survey to ascertain just what the high schools 
of the state wanted to be included in a psychology text. With this end 

* This is a report of a survey conducted by G. A. Kelly, H. E. Schammel, J. A. 
Glaze, Sister Anthony Payne, Margaret Newcomb, and W. H. Mikesell, Chairman. 

75 











76 W. H. Mikesell 





in mind, the committee prepared a second questionnaire to be submitted 
during the academic year, 1941-1942. Various general topics, such as 
Habit, Emotion, and others suggested by the high schools, were submitted 
each to be checked once if it should be included in a textbook and checked 
twice if it should be especially included. Then sub-topics, such as “‘ How 
habits are formed,” “‘How emotions should be controlled” were sub- 
mitted to be checked in the same way. 

Then it was thought wise to have the high schools submit problems 
that should be treated in a text, such as personality adjustment, voca- 


tional guidance, lack of confidence. 


Also the question was put as to 


whether a high school psychology course should be a combination of text 


and workbook. 


Results of Second Survey 


It was difficult to organize the answers received, but the committee 
felt that the simplest method was to list the topics in order of preferences. 


This was done by percentages. 


I. In answer to the question as to what main topics should be included 
in a high school text, the following percentages are given: 


General Topics: 


83% voted for Emotion 

83% voted for Power of Environ- 
ment 

81% voted for How to deal properly 
with people 

78% voted for Personality 

77% voted for Self-Confidence 

76% voted for What is Psychology 

75% voted for Habit 

73% voted for How to study 

73% voted for Mal-adjustment 

72% voted for Intelligence 

72% voted for Memory 

70% voted for Success 

70% voted for Learning 


69% voted for Leadership 

68% voted for Vocational Guidance 

68% voted for Thinking 

68% voted for The Art of Conver- 
sation 

67% voted for Anger 

67% voted for Mind and Body 

67% voted for Individual Differ- 
ences 

66% voted for Sensations 

65% voted for Nervous System 

62% voted for Object Fears 

60% voted for Observation 

59% voted for Association 

53% voted for Irritability 


II. In answer to the question as to what sub-topics should be included in 
a high school text, the following percentages are given: 


Sub-topics 
92% voted for Use of Habit 
81% voted for Place of psychology 


in life 
81% voted for Rules for study 


80% voted for How habits are 
formed 

79% voted for Benefit and harm of 
emotion 








Psychology in Kansas High Schools 77 


Sub-topics—(Continued) 


79% voted for Methods of breaking 
a habit 

77% voted for “Is environment 
more important than heredity?” 

76% voted for ‘‘Can leadership be 
acquired?” 

wh voted for The meaning of 


1.Q. 

75% voted for ‘Can memory be 
improved?” 

74% voted for Remedies for Con- 
trolling emotions 

74% voted for Kinds of Learning 

74% voted for Principles of Learn- 
ing 

73% voted for How emotions arise 

73% voted for Inefficiency of study 

73% voted for ‘‘ What is learning?”’ 

73% voted for Principles aiding 
memo 

73% voted for “Can personality be 
acquired?”’ 

73% voted for ‘‘ What is success?” 

73% voted for Some guides to suc- 
cess 

72% voted for ‘“‘ What is emotion?” 

72% voted for Definition of psy- 
cholo 

72% voted for Subjects to which 
rules of study apply 

71% voted for Mental fitness for a 


jo 

71% voted for Principles for human 
control 

71% voted for “What is self- 
confidence?” 

71% voted for ‘‘ What is memory?” 

70% voted for Overcoming a lack 
of confidence 

70% voted for How mind affects 


body 

70% voted for Measuring per- 
sonality 

69% voted for The purpose of 
emotion 

69% voted for Why people do not 

t =e with others 

69% vo ted for Origin of lack of 

outlets 


69% voted for Various uses of the 
term “personality” 
69% voted for ‘‘What is leader- 
ship?” 
69% voted for Aids for the Art of 
conversation 
68% voted for How body affects 
mind 
68% voted for Difference between 
psychology and other studies 
68% voted for What makes poor 
conversation 
67% _— for Art of human con- 
tro 
67% voted for “‘Why do people 
get angry?” 
67% voted for Remedies for anger- 
control 
67% voted for Better ways of 
learning 
67% voted for Principles of sound 
thinking 
67% voted for ‘‘Can one improve 
his thinking?”’ 
67% voted for “‘ Why do not people 
think more?” 
67% voted for Principles aiding 
observation 
67% voted for Aids in acquiring 
leadership 
67% voted for Conversational abil- 
ity and pride 
65% voted for ‘‘ What is environ- 
ment?” 
65% voted for Misuse of memory 
64% voted for Function of sense 
organs 
64% voted for Touch sense 
64% voted for Importance of in- 
dividual differences 
64% voted for ‘“‘What makes for 
individual differences?”’ 
63% voted for “What is intelli- 
Bagel 
@ voted for Spread of intelli- 
ence in population 
voted for ‘‘ Is anger justified?”’ 
6300 voted for Significance of in- 
dividual differences 








78 W. H. Mikesell 


Sub-topics— (Continued) 


63% voted for “What is malad- 
justment?”’ 

63% voted for Aids for adjustment 

62% voted for Where to secure job 
information 

62% voted for ‘‘Do we know how 
we think” 

62% voted for ‘“‘What is Observa- 
tion?”’ 

62% voted for Kinds of Object 
fears 

61% voted for Kinds of sensations 

61% voted for Some working prin- 
ciples of association 

61% voted for Practical value of 
association 

61% voted for Why people are poor 
observers 

61% voted for “How do object 
fears arise?”’ 

61% voted for Origin of mal-ad- 
justment 

61% voted for “Can we know the 
secret of success?”’ 

60% voted for Vocational Tests 

60% voted for Close relation of 
mind and body 





60% voted for ‘What is meant by 
Association?’”’ 

60% voted for Remedies to over- 
come object fears 

58% voted for Sound sensations 

57% voted for Development of 
nervous system 

57% voted for Sight sensations 

57% voted for ‘‘ Why do people be- 
come irritable?” 

57% voted for Remedies of irrita- 
bility 

55% voted for Function of a nerve 

55% voted for Parts of the nervous 
system 

52% voted for Description of a 
Neuron 

52% voted for Kinaesthetic sensa- 
tions 

48% voted for ‘‘ How does environ- 
ment affect the individual?” 

48% voted for “Is irritability justi- 
fied?”’ 

37% voted for Little or fair amount 
of discussion of intelligence 

36% voted for Other sense organs 
described 


In answer to the question: “Other general topics and their particular 
phase which you think should be discussed in a high school psychology 
test” the first ten additional topics and their votes and percentages are 
here given, though the first was included under “‘F’”’ in the first section, 
but not perhaps sufficiently emphasized. Only a few schools suggested 
additional topics. 


12 or 9% voted for Inheritance 
11 or 8% voted for Character 
10 or 7% voted for Religion 

9 or 7% voted for Reasoning 


6 or 4% voted for Marriage Pre- 
paration 

6 or 4% voted for Etiquette 

5 or 4% voted for Relation of In- 


9 or 7% voted for Mental Hygiene 
6 or 4% voted for Social Aspects 
of psychology relative to moral 


dividual to society 
5 or 4% voted for Drives and 
Instincts 





codes 


III. In answer to the question as to what constitutes important problems 
for high school students, the following fifteen, and their votes and 
A problem receiving less than five votes 


percentages are given. 





Psychology in Kansas High Schools 79 


is not included in this list. 


These are not additional topics, but 
subjects that imply serious difficulty for the student. 


All the 


schools did not emphasize problems. 


28 or 21% voted for Personality 
Adjustment 

24 or 18% voted for Vocational 
Guidance 

13 or 9% voted for Adjustment to 
Environment 

12 or 9% voted for How to Study 

12 or 9% voted for Emotions 

9 or 7% voted for Boy-Girl Re- 

lations 


9 or 7% voted for Mind and Body 
Relations 
8 or 6% voted for Social Problems 
6 or 4% voted for How we learn 
6 or 4% voted for Self-confidence 
6 or 4% voted for Leadership 
6 or 4% voted for Reasoning 
5 or 4% voted for Individual Dif- 
ferences 
5 or 4% voted for Mental Hygiene 


5 or 4% voter for Sex Education 


IV. In answer to: “‘ Do you think a high school psychology course should 
be a combination of textbook and workbook?” we received the 
following: 

78 or 58% voted “ Yes”’ 
48 or 36% voted “No” 


Conclusions from Second Survey 


Four principal ideas emerge from this second study: 

First, if we should take the first ten principal and sub-topics as placed 
in order of percentages, we see a very marked change of demand for con- 
tents of a psychology text as compared with psychology texts of long ago, 
and even of rather recent origin. Of course, we are speaking of college 
textbooks, and it may be said that the high schools present a different 
problem from that of colleges, yet it may be replied that the high school 
represents a greater development of a college trend. Let us compare 
this suggested content with contents of texts of years ago. 

If we take Titchener’s ‘“‘A Text-Book of Psychology” published 23 
years ago as typical of the old-time text, we find that about half of the 
book is devoted to sensations alone. Nothing is demanded of sensations 
in the first ten main or sub-topics chosen by these Kansas high schools. 


Titchener’s first ten topics which really cover the book include: 


1) Subject matter of Psychology 
2) Sensation 

3) Affection 

4) Attention 

5) Perception 


6) Association 

7) Memory and Imagination 
8) Action 

9) Emotion 

10) Thought 


While emotion is included in Titchener’s first te: topics, it is treated 
in a very academic or impractical manner. 








80 W. H. Mikesell 





If we take Seashore’s “Introduction to Psychology” published nine- 
teen years ago, we find his first ten topics include: 


1) Introduction 6) Taste and Smell 

2) Sensory Experience 7) Cutaneous, Motor and Static 
3) Color Vision Senses 

4) Visual Space 8) Traits of Perception 

5) Hearing 9) Attention 


10) Mental Images 


Here most of the first ten chapters are on sensations and no practical 
topic is included. 

If we take Warren and Carmichael, ‘Elements of Human Psychol- 
ogy”’ Revised Edition, published only twelve years ago, we find the first 
ten topics include: 


1) Survey of the Field 6) The Senses: Hearing and Other 
2) Structure of the Nervous System Senses 
3) Operation of the Nervous Sys- 7) Perception 


tem 8) Learning, Association and 
4) Mental Life Habit 
5) The Senses: Sight 9) Imagination and Memory 


10) Feeling, Emotion and Volition 


Here are included the topics of emotion and habit which are in the 
first ten main and sub-topics of the high school preference, but they are 
discussed very little from the practical standpoint. 

If we take Cole’s ‘General Psychology” published as recently as 
three years ago, we find the first ten topics include: 


1) Animism and brain psychology 6) Emotion 
2) Thenervoussystemandbehavior 7) Motivation 
3) Receptors 8) Learning 
4) The Effectors 9) Perception 
5) The problem of development 10) Thinking 


Here is a fair sprinkling of practical topics but they show a predominance 
of the old impractical approach. 

However, with due respect to some of our most recent texts in first- 
course psychology, we find a practical approach stressed, including topics 
mentioned in the first ten preferences of the Kansas high schools, such 
as is found in the books by Husband, Gray, Pressey, Tiffin, Knight, 
McKinney, Ruch, and others. 

It may be said that the high schools should deal only with psychology 
in a preliminary way and the college texts should, therefore, of necessity 
contain topics not suggested for a high school text. But the recent 
Kansas Academy survey indicates that the high schools do not want only 





Psychology in Kansas High Schools 81 


a preliminary but rather a self-sufficient treatment. The high schools 
want a psychology that will sufficiently guide a student in respect to his 
own difficulties. 

The second important idea emerging from this second survey is in 
respect to the additional topics suggested by the high schools of the Sun 
Flower State. It is true that only a small number of schools voted for 
additional topics, yet this may be indicative of a growing tendency for 
discussion of certain topics now neglected. If we should select four out 
of the first ten, inheritance, religion, marriage preparation, drives and 
instincts, we find an interesting demand. Apparently some students want 
something practically offered in a psychology text on these subjects. 
But do not textbooks furnish answers to these topics? The answer is in 
the negative. Some students are much concernec over the problem of 
how much inheritance plays in our lives, but it must be admitted that 
what authors say about this subject leaves the average individual perched 
still high in the air. When it comes to religion, a practical psychological 
chapter could be written that could be independent of denominational 
bias, offering distinct help to the pupil. But what author dares to write 
on this subject? A chapter on some cardinal principles of the psychology 
of marriage could also be furnished in a text, but authors generally steer 
clear of this “‘forbidden” topic. Students are also troubled about the 
origin of our drives, but they -vould have difficulty should they read most 
of our treatises in this field. 

Third, the next important idea emerging from this survey is the em- 
phasis on certain problems to be solved. Personality adjustment heads 
the list. This topic alone offers a challenge to any author or authors of a 
high school text. Psychologists are constantly talking about adjustment, 
but they talk about something that for the most part still needs defining. 
Standards are still too much lacking in discussions of the subject, and of 
course criteria to meet these standards. Certain psychologists oppose the 
setting up of ideals. Nevertheless, psychologists who talk of adjustment 
must set up standards. Emotions, for instance, is still another problem 
that baffles students, but psychologists are still dodging an attempt to 
bring definite information and practical suggestions about this most 
troublesome issue to people. Little also is offered by authors in building 
up self-confidence and leadership, yet these are among the biggest con- 
cerns of a student. 

Fourth, the last important idea arising from this second survey, is in 
regard to the vote for a combination of text and workbook. The vote is 
perhaps large enough to warrant a publisher taking the risk of publishing 
one. We know that a workbook for a college course is of doubtful value 
as far as a publisher is concerned, but high schools probably want sup- 





82 W. H. Mikesell 


plemental material to a text more than colleges do. Since 78 out of 126 
high schools that voted on the question wanted a workbook, this is a 
sufficient proportion to encourage an author to proceed with one. 


Summary 


From this survey, the main challenge derived is the need to prepare a 
high school psychology textbook for Kansas high schools which will 
offer practical aid in dealing with the special problems that beset the 
average high school student. It should probably be a thoroughly prac- 
ical text. 





G. Stanley Hall, Psychologist and Educator 


Rose Zeligs 
Avondale School, Cincinnati, Ohio 


Do eminent men spring from unknown families? A search into the 
background of such men will usually show strong hereditary roots and 
favorable environmental forces which combine to shape their personality 
and mark their destiny. Forbears of such individuals often failed to 
achieve recognition because some of the necessary factors were lacking. 

A glimpse into the life of G. Stanley Hall will illustrate this theory. 
Hall’s physical and social heredity were conducive to growth and achieve- 
ment. He was born on February 1, 1846, in Ashfield, Franklin County, 
Massachusetts. His parents came of old New England stock which 
showed physical vitality, long life, leadership, and mentality that was 
above average. 

His father had given up teaching as a permanent vocation because he 
found farming more remunerative. Important men rarely have sufficient 
time to give to their wives and children. They are themselves the center 
of their universe and all else revolves about them. The contrary was true 
of Granville Bascom Hall. His capacity and striving for other things 
outside of his daily work found expression in the interest and education of 
his three children of whom Stanley was the eldest. 

The mother had also been a teacher. She was the daughter of a 
deacon, religious, quiet, gentle, and self-controlled, but with an inner 
fervor that was permitted expression only in her diaries. She too found 
an outlet for her inner cravings for self expression in the love and training 
she gave her children. The parents taught their children good manners, 
music, oratory, and many other things. The mother read to the family 
from the best books of that time. In addition to this, all the duties and 
activities of the farm gave opportunities for parental influences and atti- 
tudes to mold the characters of the children. 

The young Hall must have felt this striving of his parents to achieve, 
for the thing he dreaded most was mediocrity. On the farm the boy had 
an opportunity to gain first hand knowledge of nature. There too he 
developed an interest in the life and habits of animals. He received his 
early education in a country school. In 1863 he entered the senior year 
of Williston Seminary where at first he felt intensely self-conscious. 
Frem the seminary he went to Williams College where he came under the 

83 








S4 Rose Zeligs 


influence of the famous Mark Hopkins. He graduated in 1867 with a 
number of scholastic honors. 

A year spent in the Theological Seminary in New York City did not 
help Hall decide his future vocation. On borrowed money he spent three 
years in Europe studying physiology, theology, philosophy, and a number 
of other unrelated subjects. When he returned to New York he con- 
tinued with his studies at the Union Theological Seminary from which 
he took his B.D. degree in 1871. After teaching English in Antioch 
College and later at Harvard where he received his Ph.D. in psychology 
in 1878, he again left for Europe for the definite purpose of studying 
experimental physiological psychology under Wundt. 

Lectures on education and philosophy at Harvard and in psychology 
at the Johns Hopkins University were followed by a full-time professor- 
ship in psychology and pedagogy at Hopkins. There too he founded the 
first American psychological laboratory in 1883 and established the 
American Journal of Psychology in 1887. 

Among his students were John Dewey, J. McKeen Cattell, H. H. 
Burnham, G. W. Donaldson, E. C. Sanford, G. W. T. Patrick, Joseph 
Jastrow, and Woodrow Wilson. 

We can see how all the seemingly unrelated influences and experiences 
in Hall’s life culminated in a more lasting interest in psychology and 
education. He left Hopkins University to become the president of Clark 
University which Mr. Clark wished to found. Most of Hall’s labors in 
visiting institutions in Europe and the United States, in gathering books 
and materials, and in bringing together a very unusual faculty of eminent 
) men were of no avail for Mr. Clark gave no more money to the University. 
) The only available fund was $24,000.00, the income on the endowment. 

President Harper of the University of Chicago came to Worcester 
and engaged the majority of Hall’s staff at double their Clark salaries. 
In addition to his administrative work Hall now devoted some of his 
time to teaching. Mr. Clark, at his death in 1900, gave nearly all of his 
estate for the establishment of an undergraduate department at the 
University. Hall was president of Clark University for thirty-one years. 

How did all those forces, from which emerged Hall’s dynamic per- 
sonality, direct the trend of education and child psychology in the United 
States? It is difficult to define and delimit this man’s interests because 
of their tremendous scope. His wide reading and imagination, the in- 
fluence of Spencer and Darwin, Hall’s earlier interest in animais together 
with his background in philosophy and psychology were adequate prepa- 
rations for the growth in child psychology. 

Both psychoanalysis and evolution appealed to him. The idea of the 
unconscious permitted his vivid imagination to build up countless possi- 





q 





G. Stanley Hall, Psychologist and Educator 85 


bilities from which to draw explanations for life and behavior. The 
doctrine of evolution gave him the idea of the constant unfolding of 
everything in nature. In the development of the child he saw what was 
for him complete evidence of the organic unity and inter-relatedness of 
life and matter. He felt that psychology had not sufficiently stressed 
genetic concepts which to him were so useful to explain the origin of 
psychic traits. The study of child development was therefore much 
more valuable and meaningful than some minute study of a cross section 
of the adult mind. The influence of the evolutionary theory is most 
clearly expressed in Hall’s methods and contribution to child study. 
The theory of Recapitulation was the nucleus of his thinking in his work 
on comparative and child psychology. 

But what was the status of child psychology before Hall? In the 
United States the child was neither seei nor heard in psychology. With 
the exception of Rousseau’s ‘‘ Emile” and the contributions of Pestalozzi 
and later that of Froebel, little interest was shown in children. The 
scientific study of the child began near the end of the nineteenth century 
while actual experimentation did not begin until the twentieth century. 
Previous to 1872, when Darwin’s “‘Expression of the Emotions in Man 
and Animals” appeared, not more than a dozen major studies of the child 
existed. Before the end of the century the increase was manifold. 

The early methods were mostly biographical. The studies of Tiede- 
man, Darwin, the Scupins, the Sterns, and Shinn may be classed under 
that approach. Later, more specialized group studies were made in the 
language and drawing of children. 

Hall’s ‘‘Contents of Children’s Minds,” published in 1883, created 
wide comment and was translated into many languages. In this work 
as well as in many others he used the questionnaire method which has 
come to be associated with his name although it was previously used by 
Galton. The book proved to be a powerful stimulus to the study of child 
development and Hall was acclaimed the father of the child study move- 
ment in America. Hall’s greatest contribution is his study of adoles- 
cence, “‘ Adolescence: Its Psychology, and Its Relations to Physiology, 
Anthropology, Sociology, Sex, Crime, Religion and Education,’’ published 
in 1904. Later he produced most of this material in a smaller book 
called, ‘‘ Youth, Its Education, Regimen, and Hygiene,” which came out 
in 1907. In this work he discussed pre-adolescence, the muscles and 
motor powers, industrial education, manual training, gymnastics, play, 
sports and games, faults, lies, and crimes, biographies of youth, the 
growth of social ideas, intellectual education and school work, the educa- 
tion of girls, and moral and religious training. An especially interesting 











86 Rose Zeligs 





chapter is that on biographies of youth in which the life histories of many 
interesting historical figures are given. 

Mention must be made of Hall’s fascinating auto-biography, “Life 
and Confessions of a Psychologist,” published a year before his death. 
It is here that one gets a concrete, realistic understanding of this active, 
energetic man, eager to accept ideas, who remained in some respects 
always an adolescent. 

When we realize the lack of interest shown in child psychology during 
the latter part of the nineteenth century, and the mass and mechanical 
methods used in the schools, and compare this with the vast body of 
material that exists today, we can perhaps realize the power of Hall’s 
inspiration. Probably no single person has fostered as much interest in 
the study of the child. Among his students who are now recognized as 
experts in this field are Gesell, Goddard, Mateer, Terman, Blanchard, 
and Kuhlmann. 

In comparing Hall’s list of studies with the contents of ‘‘The Hand- 
book of Child Psychology” published by Clark University Press, it is 
amazing to find that a majority of the subjects discussed there were pre- 
viously treated by Hall. 

In 1909 Clark University commemorated the close of its second de- 
cennium by a series of conferences on child welfare. This resulted in the 
organization of many agencies concerned with the care and study of the 
child. 

Hall’s method of collecting data from all sources and with the aid of 
all kinds of untrained workers makes the validity and reliability of his 
material questionable. His knowledge of philosophy and vivid imagina- 
tion caused him to read into his data what was not always there and to 
fill in the necessary gaps. Of course, the extent of the subjects covered 
made objective, careful methods impossible. Though fact and fiction 
are found interwoven in his material it must not be judged by present 
standards but by the methods and techniques known when his studies 
were made. Since then much progress has been made in methods and 
techniques of research in child development. Yet in a way Hall’s faults 
were his virtues because they caused him to contribute much more than 
he could otherwise have done and thereby start a movement which has 
stimulated many workers. 

In conclusion it may truly be said that G. Stanley Kall was a man of 
varied interests, with a vivid and productive imagination, aa indefa- 
tiguable worker, an inspiring administrator, teacher and organizer. 
The extent of his contributions to the development of child study in 
America is measureless and he may well be called the father of child 
psychology in America. 





) 
| 
4 
; 
l 
3 
y 
3 


G. Stanley Hall, Psychologist and Educator 


Bibliography 


Boring, E.G. A History of Experimental Psychology. New York: The Century Co., 
1929. 

Goodenough, F. L., and Anderson, J. Experimental Child Study. New York: The 
Century Co., 1931. 

Patridge, G. E. Genetic Philosophy of Education. Sturgis and Walton, 1912. 

Hall, G. 8. Adolescence; Its Psychology and its Relation to Physiology, Anthropology, 
Sociology, Sex, Crime, Religion and Education. New York: D. Appleton and 
Co., 1904. 2 volumes, 1373 pp. 


Hall,G.S. Aspects of Child Lifeand Education. Boston: Ginn and Co., 1906. 379 pp. 


Hall, G.S. Life and Confessions of a Psychologist. New York: D. Appleton and Co., 
1923. 623 pp. 


Murchison, C., Editor. A Handbook of Child Psychology. Worcester: Clark Uni- 
versity Press, 1933. 

N.E.A. Proceedings, 1908. Pp. 948-952. 

Wilson, L. N. G. Stanley Hall. New York: G. E. Stechert and Co., 1914. Pp. 144. 








The Prediction of Differential Achievement in a 
Technological College 


William McGehee 
North Carolina State College of Agriculture and Engineering 


Investigators concerned with attempts to predict academic success 
by means of psychological tests have been, in the past, less interested in 
prediction of differential academic achievement than in prediction of 
general academic success. It would appear that data are needed, not 
only for the prognosis of general academic aptitude, but also for deter- 
mining an individual’s chances of success in a given academic field even 
though he lacks aptitude for some other academic pursuit. In other 
words, the counselor and the administrator need information concerning 
the possibility of a given student’s success in agronomy as contrasted 
with his chances in the field of chemical engineering. 

There has also been a tendency in many places to use tests compiled 
by well-known individuals in the field of psychometrics regardless of their 
value in a given situation. It is entirely possible for test X to be a good 
prognostic technique in college A and just a waste of time in college B. 

The purpose of this investigation is to examine the general and dif- 
ferential predictive value in a technological college of certain widely used 
psychological tests. The tests studied are the American Council Psy- 
chological Examination (1939 edition), the Cooperative English Test 
(Form OM), and the Cooperative Mathematics Test (From P). Aca- 
demic achievement is defined in this study as the grades made during 
the freshman year by 700 students enrolled in the North Carolina State 
College of Agriculture and Engineering. Grades were converted into a 
point-average ratio in terms of a formula’ used by the registrar in this 
institution. 

Point-average ratios, as well as scores on each of the specified tests, 
were available on 700 of the 732 students who entered as freshmen in the 
school year of 1939-40. Table 1 gives the distribution of these students 


* The author expresses his appreciation of the aid of Mrs. Grace Z. Moen, Graduate 
Student, N. C. State College, for her assistance in collecting the data on which this 
study is based. This report was presented in part at the 1942 meeting of the Southern 
Society for Philosophy and Psychology, Nashville, Tennessee. 

Credit points — 2 (hours failed) + 65 
Total number of credit hours + hours failed ? 
88 


1 Point-average ratio = 10 








Prediction of Differential Achievement 89 


according to major curricula, the mean scores of each curricula group on 
the various tests, and the means of the point-average ratio. As a rule, 
all freshmen enrolled in the same curricula take the same subjects, al- 
though the subjects required in the various curricula differ. The only 
subjects which all students are required to take regardless of curricula, 
are English, Military Science, and Physical Education. 


Table 1 


Distribution of Students by Curricula, Mean Scores on Psychological Tests, and 
Mean Point-Average of Each Curricula Group 


Means of Tests 
Number 


Students A.C.E. English Math. 


43.3 51.5 

47.4 57.0 

44.2 52.2 

- 40.4 52.0 

All curricula 7 . 45.4 54.6 


* Teachers of Vocational Agriculture and of Industrial Arts. 














Zero order and multiple correlations were computed between academic 
achievement and the test scores on each test for each curricula and for 
the entire class. All the coefficients of correlation satisfy the require- 
ments of Fisher’s ¢ test of significance. The coefficients are shown in 
Table 2. 


Table 2 


Zero and Multiple Coefficients of Correlation Between Point-Average Ratio 
and Test Scores by Curricula for All Curricula 


Curricula English Math. 
Point Average ; Test Test All Tests 
27 
.50 
.52 
53 
43 











The indices of forecasting efficiency * of these various coefficients of 
correlation give a clearer indication of their value in predicting academic 
success in the various curricula than is apparent in Table 2. The indices 
are given in Table 3. 

It is obvious, from the data presented in Table 3, that the tests used 
in the investigation give the least reduction in the errors of prediction of 


* Guilford,T.P. Psychometric Methods. New York: McGraw-Hill Book Company, 
1936. 

















90 William McGehee 


Table 3 


Index of Forecasting Efficiency (E) for the Coefficients of Correlation between Point- 
Average Ratio and Test Scores by Curricula and for All Curricula 








Curricula A.C.E. English Test Math. Test All Tests 
PR, go occcncepeu eats 6.0 6.0 3.9 8.8 
RRP 2 6 Cara 12.3 11.2 13.4 17.8 
San 6.0 7.9 14.6 16.5 
Vocational education......... 10.7 18.5 15.2 24.0 
a ee 10.2 10.2 9.7 15.2 





academic success in the field of agriculture. Further, the A.C.E. and 
the English test seem to be of less value in predicting academic success in 
textiles and agriculture than in vocational education and engineering. 
It would seem, then, that the tests used are measuring less accurately 
those abilities required for academic success in the fields of textiles and 
agriculture than they are in the fields of engineering and of vocational 
education. 

The standard partial regression coefficients (Table 4) computed in 
securing the multiple R’s also give some indication of the relative value 
of each test in differential prediction. 








Table 4 
Standard Partial ion Coefficients for the Various Tests for Each 
icula and for All Curricula 
Curricula A.C.E. English Test Math. Test 

cea aa EC eee = yo — 
Nts a\s sic + cidade 4 j ‘ i 
ARTE SR A Fa .077 .247 444 
Vocational education.............. .044 395 313 
PS 655 ks on a's ctaaktenhe .214 181 .241 





In terms of the data in Table 4, the A.C.E., in combination with the 
other two tests, has the least value in predicting academic success in 
Textiles, while the English and Mathematics tests seem to contribute 
least to prediction in the field of agriculture. 

It is difficult to draw clear-cut conclusions concerning the value of the 
tests in differential prediction. The data from the indices of forecasting 
efficiency indicate that more reliance can be placed on the A.C.E. and 
the English test in predicting academic success in vocational education 
and engineering than for predictions in the fields of textiles and agricul- 
ture. The tests used are of less value in predicting success in agriculture 
than in any other of the academic fields. Least error seems to be present 
in predicting success in the field of vocational education on the basis of 
scores on the tests used. 


Prediction of Differential Achievement 91 


The tests, in combination with each other, show a somewhat different 
picture when the standard partial regression coefficients are examined. 
The A.C.E. seems to carry less weight in predicting academic success in 
textiles and vocational education and the greatest weight in the fields of 
agriculture and engineering. The English and Mathematics tests seem 
to have most weight in predicting academic success in the fields of textiles 
and vocational agriculture. 

This difficulty in drawing clear-cut conclusions may arise from the 
fact that the tests used are not adequate predictive instruments for either 
general or differential success in the technological fields involved. In 
terms of E, the maximum error reduction in prediction (Table 3) is 
twenty-four per cent. In one instance, it drops to less than four per 
cent. In terms of prediction of general achievement in a technological 
college, a combination of the tests reduces the error of prediction only 
15.2 per cent. 

The coefficients of multiple determination based on the multiple R’s 
(Table 2) indicate that 83.2 per cent of factors determining success in 
agriculture are not accounted for by the tests used in this investigation. 
Similarly, 67.5 per cent of the factors in Engineering, 69.8 per cent of the 
factors in Textiles, 57.8 per cent of the factors in vocational education, 
and 71.9 per cent of the factors in all curricula are not accounted for by 
the tests used in this investigation. It is, perhaps, not to be wondered 
that no clear-cut differential prediction is found. 

It has been asserted by counselors and others that, even though psy- 
chological tests fall down in mass prediction, they are of value in predict- 
ing the performance of individuals who stand either high or low on the 
tests. 

To test this hypothesis, the writer constructed scatter diagrams of 
the point averages and raw scores on the various tests of individuals who 
stood in the upper three deciles of each test, the middle four deciles, and 
the lower three deciles. If the hypothesis is correct, higher correlations 
should be obtained for the upper and lower three deciles than for the 
middle four deciles. Higher r’s should be obtained also for the upper 
three and the lower three deciles than for the total group. Only coeffi- 
cients for the upper, middle, and lower groups on the A.C.E. were com- 
puted, for it was obvious from inspection of the scatter diagrams that 
computed r’s would not support the hypothesis. The r’s for the A.C.E. 
are shown in Table 5. They are all small but show a little more than 
chance relationship. 

It would seem, on the basis of the data presented, that the tests 
examined in this study have only a limited usefulness in predicting either 
general or differential success in a technological college. The writer is 

















92 William McGehee 


Table 5 


Zero-Order Correlations Between the A.C.E. and Point-Average for the Upper 
Three, Middle Four, and Lower Three Deciles of the A.C.E. 








Correlation of 
Point-Average and 
Decile Groups A.C.E. N 
8-10 Deciles 20 210 
4-7 Deciles 15 290 
1-3 Deciles 15 210 





well aware that the correlation coefficients are not out of line with those 
reported by other investigators.* He also realizes that factors, such as 
motivation, financial sufficiency, or even a girl back home, enter in deter- 
mining whether or not a college freshman is academically successful. 
He is well aware of the idiosyncrasies of college teachers in assigning 
marks. He, however, wonders if something else does not enter in making 
the value of these tests only slightly greater than the knowledge that the 
student has been graduated from a high school. 

It is entirely possible that the tests themselves, constructed on dif- 
ferent populations, fail to measure certain aptitudes and knowledges 
required for success in the particular situation in which they are being 
used. It is well known that tests which are successful in selecting workers 
in a given plant cannot be adopted unmodified for selecting workers in 
another plant, even though that plant may be manufacturing similar 
articles. The writer, therefore, suggests that a job analysis of the re- 
quirements for academic success in a given institution would reveal apti- 
tudes and knowledges needed there. It is entirely possible that tests 
constructed to measure the specific characteristics thus uncovered would 
give coefficients of non-determination of much less magnitude than 71.9. 


*Segel, David. ‘Prediction of Success in College.’ United States Department of 
the Interior, Bulletin No. 15, Washington, D. C., 1934. 


Interests of Business College Students: An Indirect 
Validation of the Strong Blank 


W. Leslie Barnette, Jr. 
Pace Institute, New York City 


Considerable work has been done, using the Strong Vocational Interest 
blank as a measuring rod, on the professional interests of college students. 
Such students have in general been oriented toward a career in teaching, 
engineering, medicine, law and the like. Little has been published on 
the validity of this blank as it relates to business interests. This report 
deals wich this phase of the problem and, in addition, notes some relation- 
ships among professional business interests. 

The University of Minnesota investigators used the Strong blank on 
lower occupational levels, principally business ones, of workers in general 
(1). Their data showed such workers evidenced no marked interests in 
science or in occupations involving language facility or social service and 
that they made more As and Bs in the commercial group. The authors 
regarded this as an indirect validation of the blank, since the patterns 
Strong isolated for the professions occurred rarely among workers in 
general. The trend of the data reported here is similar. Other workers 
in guidance areas, for example Keys (3), have criticized the blank in that 
no scoring stencils have been developed for other commercial fields, 
particularly for trades that high school students enter. But the blank 
was never intended by Strong as a measure of such patterns; he limited 
himself almost entirely to what he regarded as professional interests. 

Many years ago when Strong first published his blank, he included 
several scoring keys for business occupations: life insurance sales, pur- 
chasing agent, real estate sales, accountant, CPA, etc. The inclusion of 
office clerk on the older blank (Form B) was an exception to the general 
professional level maintained. When the revised blank (Form M) was 
issued in 1938, a few of the business keys were dropped and others added. 
Office clerk has become office worker; vacuum cleaner sales was eliminated 
in favor of sales manager. All the more strictly “ professional’’ occupa- 
tions have been retained. The arrangement of the various occupations 
into three convenient groups for machine-scoring has unfortunately been 
shifted, so it is difficult to compare any one group of nine scores (7.e., 
Group I as the Psychological Corporation scorings go) from Form B to 
Form M. 

93 



































94 W. Leslie Barnette, Jr. 


In connection with the course in applied psychology given by the 
author at a collegiate business school, one of the topics discussed is voca- 
tional guidance. At one stage the value of interest measurement is 
treated. Prior to this the students fill out the Strong blank. These are 
then machine-scored for the particular group of nine occupations in which 
their field of specialty falls. Such students are predominantly male, in 
their second year of college work, and are preparing to enter one of the 
following occupations: CPA, accountancy, advertising, selling. The 
majority are 19 years of age. After an assignment and class discussion 
on interest measurement their scores are then made available to them 
in an individual interview. The reader is reminded that most such 
students have, by their second school year, definitely decided on their 
business specialty. The mere fact they have survived three semesters of 
20 hours of work weekly, most of it concentrated in business fields, would 
indicate some stability of their interests. 


CPAs 


Turning to the CPA scores on Form B for the first group of nine 
occupations, it will be seen that CPA interests are similar to those of 
Purchasing Agent (Table 1). This is not surprising since both occupa- 








Table 1 
CPA Interests—Related & Unrelated Business Occupations. % A, B, C scores 
Form B, N = 39 
Purch Life Pers Schl 
CPA Law Agent Engin Phys Adv Ins Mgr Teach 
A 28 3 21 5* oe _ 3 — 
B 46 54 69 13 8 10 38 46 21 
Cc 26 43 10 82 92 90 59 51 79 
Form M,N = 54 
Math-Ph Soc Sci 
CPA Law Phys Sci Teach Engin Min HSTch Math Chem 
A 35 2 13* P _ 13* — 4* 
B 43 55 15 50 22 7 46 18 18 
C 12 43 79 37 70 93 41 82 78 





* See accompanying text. 


tions entail similar duties. CPAs show commonality with Law interest, 
but more in the B range of scores than elsewhere. Of the other occupa- 
tions, unrelated to the common view of business, few A’s are made. This 
is especially notable with the Life Insurance key; even though a “busi- 
ness’’ occupation, little similarity of interests exists between this and 
professional accountancy. The 5% As for Engineer appear reasonable 
since the two students represented by this percentage formerly had con- 
sidered this as a career possibility. 


aye Ui A Sopher ea 


Interests of Business College Students 95 


On Form M the grouping of the occupational keys has been altered. 
The Law trend shows up as on Form B. The two teaching keys yield a 
higher percentage of As than is expected, but here again several students 
admitted to previous interest in teaching. Thus, more As for these keys. 
A similar situation pertains to chemistry and engineering. It is interest- 
ing to note the complete lack of A’s on the Mathematician key; pro- 
fessional interests here have evidently little in common with CPA in- 
terests. The distribution of CPA scores on the Minister key bears out the 
earlier work of Strong and MacKenzie (8). 

On both forms then, CPA interest pattern is definitely distinguished 
from the other unrelated scoring keys. Inspection of the percentages of 
C scores also bears out this contention. 


Accountants 


The separation of “‘business”’ occupations from the other scoring keys 
is again shown in Table 2 dealing with Accountancy interest scores. 


Table 2 
Accountancy Interests—Related & Unrelated Occupations. % A, B, C scores 


Form B, N = 57 
Off YMCA YMCA City Schl 





Acct Cirk Sec PhysDir Math Music Supt Physics Dentist 
40 47 _— 2 —_ 12* — 2 2 
46 46 23 44 14 26 12 5 26 
14 7 77 54 86 62 88 93 72 





Form M, N = 25 
Prod Purch Real Life Sales Pers Schl 


Acct Mgr Agent Est Ins Mgr Mer Supt Adv 
32 12 36 24 4 4* Ss 


68 68 60 60 64 68 40 28 40 
be 20 4 16 32 28 52 72 60 


* See accompanying text. 





These are students enrolled in the course in accountancy and business 
administration rather than the CPA course. 

On Form B the keys for accountant and office clerk show considerable 
similarity. More A’s are earned in the latter, the less “professional”’ 
occupation; likewise there are fewer Cs. The only other occupational 
key producing a disproportionate quantity of A’s is musician, but ex- 
pressed intrests account for approximately half of the 12% here. On 
keys such as physics, dentist, etc., few A’s and many C’s are earned. 
This is as it should be if the scores are to separate occupational groups 
accurately. 

Form M gives an even clearer picture of the accountancy interest 
pattern. No C’s are earned on this main key. Presumably if the blank 











96 W. Leslie Barnette, Jr. 


were administered and scored during the first semester of business school 
rather than during the fourth, more C’s would be in evidence. It is 
reasonable to assume the less interested students have dropped out by 
the end of the second year. Accountancy students have interests some- 
what similar to those of production manager, purchasing agent and real 
estate sales; they are less like advertiser, life insurance sales, school super- 
intendent. It is difficult to explain why there are so many A’s and so 
few C’s on the real estate key. Possibly the occurrence of legal forms, 
tax computations, investment and depreciation calculations, etc., with 
which both accountants and real estate brokers are involved, explains 
some of this overlap. But, again, the business occupations are separated 
from the non-business ones. 


CPAs versus Accountants 


The interests of accountants are not as professionalized as those of 
CPA’s. This is reflected in the selection of students as they enroll in 
either the CPA Preparatory or the Accountancy and Business Adminis- 
tration courses. The intelligence and entrance requirements are higher 
in the former; the course of study is a longer one. On the surface of 
things one would presume many CPA students should score high on 
accountancy interest, but not the reverse. 

Tabulation of results shows one individual out of a total of 20 CPA 
majors receives a lower score on Accountacy interest than on CPA inter- 
est. Most of the CPA’s score the same or higher on the accountancy 
key (mean percentage of 95). The data on the accountancy majors 
shows the reverse. Few accountants scoring A turn up as A’s on CPA; 
78% score lower. The number of subjects is small here (20 CPA’s and 
28 accountants), but the trend of the data is obvious: few accountants 
have the professional interests of CPA’s. The corollary of this would 


be untrue. 
Law versus CPA 


Strong pointed out in 1927 (4) the future possibility of commonality 
between the Law and CPA keys. As the complexity of income and cor- 
poration tax laws has increased, more and more accountants have entered 
this field, formerly the more or less exclusive domain of the lawyers. It is 
to be expected that as competition between these two groups increases 
there would be a greater overlap of the two interest keys. Strong re- 
ported that 5% of lawyers rated A on CPA interest and that 6% of 
CPA’s rated A on the law key. The accompanying Table 3 presents the 
data on our student population. No law group is here involved. 

This is about what Strong found in 1927 for the A-score distribution. 
Husband (2, p. 87), quoting Strong, shows a similar trend with lawyers 





Interests of Business College Students 


Table 3 
Percentage of Total Group Scoring the Same on CPA and Law Keys. N = 93 


A B+ B B- Cc 


CPA 32 16 13 17 22 
Law 2 13 22 20 43 











scoring A on the CPA key. However there is considerable overlap in 
terms of the B’s. This holds for both Forms B and M. 57% of CPA 
students receive A or B scores on Law interest. The degree of agreement 
(rank-difference) between these two keys is closer on the old Form B 
(p .64, PE .07, N = 39) with a smaller number of cases than on the new 
Form M (p .33, PE .08, N = 54). Averaging these coefficients by Fish- 
er’s method, admittedly a rather dubious procedure, produces a mean p 
of .51 with a PE of .05 (93 cases). 

The coefficient of contingency (5 X 5 fold classification) also produces 
the value of .51 when both forms are combined. The chi-square value 
for this table is found to be 32.68; P in this case turns out to be less than 
.01 which passes the criterion of significance, there being less than one 
chance in 100 that such results could have been obtained ‘‘by chance.” 

Strong’s suggestion that the professional interests of CPA’s and 
Lawyers might become more alike seems to be substantiated as far as 
this student group will permit generalization. This does not mean the 
CPA key is invalidated; such an interest pattern is still distinct from, for 
example, that of minister, as Strong and MacKenzie (8) have shown and 
as our data also confirm. 


Advertising and Sales 


The number of cases in this group is too small for detailed treatment. 
There are only 31 blanks scored for Advertiser for both forms; 23 cases 
for one of the sales keys. Since the new Form M has altered the sales 
keys, in some cases only 11 blanks are available for analysis. Table 4 
presents the data. 


Table 4 
Score Distributions for Sales and Advertising Students 


Vac Clean Real Est Sales Life Ins 
Sales Sales rd Sales 
B B&M M 
12 23 1 ll 


25 35 55 
67 52 45 
8 13 — 


























98 W. Leslie Barnette, Jr. 


Most of the student advertisers score A or B on this key. However 
there are 26% C’s. This is explicable on the ground that the Strong key 
was standardized on account executives of advertising agencies, which 
Strong describes as “typical of the cream of the advertising profession”’ 
(7). All students in advertising courses are not preparing directly for 
this type of work; some are principally interested in lay-out, art and 
mechanical production, media, advertising research, etc. Nevertheless 
the trend of the scores is consonant with the purpose of showing con- 
geniality of interests in student versus adult professional groups. 

Inspection of other A’s earned by these students shows they are 
strictly limited to other related keys, as sales manager and real estate 
sales. Professional interests of advertisers would be expected to overlap 
here. Interestingly enough, very few student advertisers score A on life 
insurance sales—an area apparently where there is little communality. 
The B’s are scattered; they indicate in a less precise way the trend of the 
A’s. The greatest number of C’s piles up in such totally unrelated occu- 
pations as physician, engineer, CPA, school teacher, etc. 

In a similar way salesmanship courses attract divergent interests, but 
most individuals, although expecting to go through an apprenticeship in 
actual retail selling work, are looking forward to executive positions in 
selling. Inspection of the above table shows this aim in that the greatest 
percentages of A’s are obtained on keys for sales manager and life insur- 
ance sales. Both these keys produce no C’s with this student group. 
(Incidentally Strong also found (5) the score distributions for his stand- 
ardization group of life insurance salesman were foreshortened at the low 
end.) Apparently, then, these keys are more diagnostic for the average 
student enrolled in collegiate sales courses than are the others prepared 
by Strong. Life insurance success is also dependent among other things 
on age; young men do not succeed here nearly as often as older men, so 
it is doubtful if the correlation between interests and sales production is 
an adequate measure (6). Most students scoring A and B on these sales 
keys get more such scores on advertising than they do, for example, on 
accountancy where there is little overlap. This would be expected in 
the light of common interest factors existing in these professions. The 
same trend was noted above with respect to student advertisers scoring 
high on sales keys. 

Thus these Strong keys are diagnostic of business student interests in 
these areas. This constitutes additional evidence of the blank’s validity 
in its separation of occupational groups one from another. 


Sure versus Not Sure 


The first page of the blank provides a space where the student indi- 
cates his major occupational choice as well as his degree of certainty of 











Interests of Business College Students 99 


this choice. Several studies of a more experimental sort have shown 
that an individual’s degree of certainty influences the accuracy of his 
report. Below is a tabulation of the interest scores for the student’s main 
occupational preference as related to his degree of certainty. 


Table 5 
Score Distributions of Sure and Not Sure. N = 231 


B+ B C 


35 24 25 
13 13 15 
2 2 2 
39 42 











The trend is clearest with the A and the B+ scores: most “sure”’ 
students get such scores. Ninety five cases of the total 231 are repre- 
sented here which constitutes 41% of the whole sample. Against this 
only 28 of the ‘‘not sure” individuals (12%) earn such scores. The 
tendency is the same with the lower interest ratings, but not asclear. This 
presents an interesting aspect of the blank’s validity in the sense that it 
shows the measure is sensitive to such ideational differences in the indivi- 
dual. 


Summary 


1. CPA interests are distinct from other unrelated scoring keys; of 
related “business” occupations, CPA interests are similar to those of 
lawyer and purchasing agent. 

2. Accountancy interest is clearly separated from other business 
occupations; such interests are most like those of office clerk, production 
manager, purchasing agent and real estate sales. The revised Form M 
gives a more definite picture of these interests than the older Form B. 

3. The expected tendency of CPA’s scoring at the same or higher 
level on accountancy interest is confirmed; most accountants, again ex- 
pected, receive a lower score on CPA interest than they do for their own 
specialty. 

4. There is considerable overlap on the CPA and law keys, especially 
with the B scores. (C = .51 and is statistically significant.) It appears 
these two interest patterns are becoming more alike than they were 
fifteen years ago. 

5. Students in advertising produce more C’s than expected, but this 
is explicable on the basis of their divergent fields of specialization. Other- 
wise advertisers are most like sales manager and real estate sales patterns. 
C’s pile up in unrelated professions. 














100 W. Leslie Barnette, Jr. 


6. For students in salesmanship courses the two keys for sales manager 
and life insurance sales appear the most diagnostic. There is consider- 
able overlap on the advertising key. 

7. The degree of the individual’s certainty as to his choice of career 
influences the score level obtained. 

8. The Strong blank is indirectly validated with respect to these 
business occupations in that these keys are sharply separated from other 
professional interests. Few business students earn A’s in unrelated 
business pursuits. This is additional evidence of the blank’s diagnostic 
value and helpfulness in student counseling. 


References 


1. Berman, I. R., Darley, J. G., and Paterson, D. G. Vocational Interest Scales: an 
analysis of three questionnaires in relation to occupational classification and 
employment status. Bull. Minn. Empl. Stabil. Res. Instit., 1934, 3, no. 5. 

. Husband, R. W. Applied Psychology. New York: Harpers. 1934, pp. 654. 

. Keys, N. Measurement of interest and personality. Occupations, 1934, 12, 58-66. 

. Strong, E. K., Jr. Differentiation of CPAs from other occupational groups. J. educ. 

psychol., 1927, 18, 227-38. 
——. Interests and sales ability. Person, J., 1934, 13, 204-16. 
——. Relation of interest to ability in terms of life insurance interest scores and 
sales production. Psychol. Bull., 1934, 31, 594. 
7. ——. Manual for Vocational Interest blank for men. Stanford: Stanford Univ. 
Press. 1937. Pp. 16. 
8. Strong, E. K., Jr., and MacKenzie, H. Permanence of interests of adult men. 
J. soc. Psychol., 1930, 1, 152-9. 


m & bo 


Pe 














‘ec =_ “ WW 





Scholastic Achievement and Measured Interests 


Richard C. Crosby 
Miami University 


One of the most important problems in education today is that of 
measuring the motivating factors that contribute to success in school 
work. Since the author of the Preference Record, G. F. Kuder, states! 
that one of the purposes of his test is to do just this, it was thought to be 
a significant study to seek relationship between interest, as measured by 
the inventory, and achievement in various fields of college work. For this 
investigation a group of 140 sophomores in the Colleges of Agriculture 
and Home Economics at Cornell University, who were taking a course in 
general psychology, were selected. 

Due to the piling up of grades about the central tendency only the 
extremes of the group were used. The students whose scores in Scientific 
Interest on the Preference Record exceeded the 90th percentile numbered 
30 or approximately 20 per cent of the group. These people represented 
a high interest in science. A corresponding number of low scientific 
interest students with scores below the 10th percentile were chosen at 
random. The grades in introductory chemistry and in introductory 
courses in biological science (biology, botany, or zoology), were obtained 
for the two groups. Mean grades were computed for high and low scienti- 
fic interest students and the significance of the difference between the mean 
grades determined by the critical ratios.2 The results are shown in 
Table 1. 

Table 1 


Critical Ratios—Differences Between Means of Science Grades for 
High and Low Interests in Science 








Mean Mean 
Chemistry Biological 
Grade Grade 
High scientific interest............... 79.0 85.9 
Low scientific interest............... 66.0 70.0 
Se eee ee eo 13.0 15.9 
CE ED oc pS Eras asses cceness 7.6 12.2 





1 Kuder, G. F. Manual for the Preference Record. Chicago: U. of Chicago Book- 
store, 1939. P. 1. 
* Critical Ratio = Difference between two averages . 


Standard Error of the differences . 
101 




















102 Richard C. Crosby 


A critical ratio of more than 3.00 is considered to be indicative of the 
complete reliability of a difference between two averages. In both the 
case of introductory chemistry and that of introductory biological sciences 
the data show a statistically significant difference between the means. 
Although this technique does not measure cause and effect it does suggest 
considerable selective power for the instrument used. This is especially 
important because of the large actual difference between the average 
grades for high and low scientific interest students. There is real value 
in an instrument which will separate two groups whose average grades 
will differ by as much as 13 (chemistry) or 15.9 (biological science) points. 
However, it must be remembered that only the top and bottom 20 per 
cent of the group were included in this study and furthermore that the 
interests were measured after the various courses were taken and not 
before. 

In order to test the type of study described above in fields other than 
science another investigation was undertaken using the same method. 
Out of a group of 80 Hotel School students (sophomores) 26 were selected 
whose scores on the Computational division of the interest inventory 
exceeded the 90th percentile. Twenty-eight others from the same group 
were picked out because their computational interests fell below the 40th 
percentile. Average grades for each group in their freshman accounting 
course were then computed and the critical ratio for the difference be- 
tween the averages was found. Table 2 summarizes the results. 


Table 2 


Critical Ratio—Difference Between Means of Accounting Grades 
for High and Low Computational Interests 








‘ Mean 
ccounting 
Grade 
High computational interest..................... 83.0 
Low computational interest...................... 69.5 
9 RT a nee 13.5 
IN UAsds otek wee Kiet ohhh cess aceecdes 6.9 





In the case of the difference between the average accounting grades 
of students showing high and low computational interests we also find a 
definitely significant statistical difference. Again we have a large actual 
difference, 13.5 points, between the average grades of the two groups. 
This certainly suggests that the Preference Record may have considerable 
validity in educational guidance for selecting groups differing markedly 
in achievement in certain specific subjects; providing, of course, that 
similar results may be obtained previous to the taking of the courses. 


Scholastic Achievement and Measured Interests 103 


Finally, in order to determine whether there was an actual relation- 
ship between scores on the interest inventory and grades, the high and 
low interest groups were combined and the scores for both were correlated 
with their grades. This yielded the following Pearsonian coefficients: 


1. High and low scientific interest and chemistry grades.......... 64 + .05 
2. High and low scientific interest and biological grades.......... 67 + .05 
3. High and low computational interests and accounting grades.... .67 + .05 


These results confirm Kuder’s studies * in that they show a positive 
relationship between interest in certain scales of the Preference Record 
and achievement in some school subjects. However, these correlations 
must be interpreted with caution because of the fact that only the ex- 
tremes of the groups are included. On the other hand, they compare 
very favorably with coefficients found between mental ability tests and 
achievement. They certainly reinforce the main implication of the entire 
study; namely, some measure of motivating factors such as interest is 
most essential to adequate prediction of achievement in academic work. 
It may well be that a combination of measures of motivation with meas- 
ures of ability will provide us with a more adequate means for predicting 
scholastic achievement than we now have at our disposal. 


*Kuder,G. F. Ibid. 














Relation of Order of Appearance to Mark in Oral Tests * 


Norman J. Powell 


School of Business and Civic Administration, College of the'City"of New York 


A crucial question in oral tests is concerned with the consistency with 
which rating standards are applied. Does the accident of the candidate’s 
order among those examined affect the rating received? Does the cir- 
cumstance that an applicant happens to appear first, or tenth, or last, 
or that he happens to follow a superior or inferior candidate influence 
the mark awarded to him? 

A priori conclusions cannot be formulated confidently about the 
problem. Perhaps, for example, a candidate who is examined after one 
who is grossly inferior tends to be given a high mark. Possibly the con- 
trast with extreme inferiority tends to transform even mediocrity to 
seeming superiority. If so, the candidate who is sufficiently fortunate 
to have an inept predecessor will have an unwarranted advantage over 
those of his competitors who follow superior applicants. The opposite 
hypothesis may also be tenable. Perhaps a poor applicant acts in the 
manner of a contagion source and tends to impart to those who follow 
him a “halo” of inferiority. 

The importance of the study lies in the necessity for rating different 
candidates in the same oral test on a uniform basis if there is to be accu- 
rate marking of the applicants. Oral tests are generally considered (by 
central public personnel agencies anyway) as useful and even essential for 
positions in which personal characteristics are important. One of the 
criticisms of the oral test has been that its subjectivity makes impossible 
the rating of all interviewees on an identical basis. 

The data for this analysis are taken from two New York City Civil 
Service Commission examinations, selected for the purpose of exploring 
different types of test situations: 

Junior Civil Service Examiner, New York City Civil Service Commis- 
sion, $2,400 per annum. The oral test, at which there were 52 candi- 
dates, was given to the survivors of a 44 hour open competitive written 
test for which 850 persons had applied. Five persons served inter- 
changeably as examiners, two or three being present at any single oral 
testing. The test was given over a period of six days. 


* Statements made in this manuscript reflect the personal position of the writer and 
do not commit the New York City Civil Service Commission in any way. 


104 





) 
> 
J 
> 
= 
y 


cr 


Relation of Order of Appearance to Mark in Oral Tests 105 


Supervisor, Grade 3, Department of Welfare, $3,000 perannum. The 
oral test at which there were 160 candidates, followed a 5} hour written 
test which eliminated 501 persons. The examining panel consisted of 
three persons. The test was given over a period of 13 days, about 13 
persons being summoned each day. 

About the same examining procedure is followed in all New York 
City oral tests. Before the test is given, the examiners agree in confer- 
ence on the factors to be rated during the test (such as manner, judgment, 
speech, comprehension), determine the weights to be assigned to each 
characteristic and agree on the methods to be used (such as the nature of 
the questions and the standards to be employed). The test is then ad- 
ministered in accordance with the procedure determined by the examining 
committee. The oral test mark given to the candidate is the arithmetic 
mean of the individual examiners’ ratings. 


Table 1 


Junior Civil Service Examiner Candidates’ Test Marks by Order of 
Appearance in the Oral Test 


Candidate Median Median Median 
Group Oral Mark Written Mark Experience Mark 


First 10 73 77 
Second 10 79 81 
Third 10 83 75 
Fourth 10 86 76 
Fifth 10 87 77 











Maintenance of Standards Throughout the Oral Test 


Oral test marks may rise, fall, or remain quite constant during the 
progress of the examination. In order to bring out possible trends clearly, 
oral test scores have been grouped. It is indicated in Table I that 
standards tended to fall as the Junior Civil Service Examiner oral test 
went on. Whereas the median mark given to the first ten candidates is 
73 the median mark of the last ten candidates is 87, a rise of 14 percentage 
points. Were a curve to be drawn, it would be irregularly upward, but 
would tend to flatten out near the end of the test. Assuming extrapola- 
tion to be admissible, it seems probable that if the test had been con- 
tinued, greater uniformity of standards would have been achieved. 

It may be argued that the trend toward a falling standard is only 
apparent, but is really the reflex of true differences in competence. Pos- 
sibly the candidates who appeared later in the examination were truly 
superior to those examined earlier. To verify this possibility, the ex- 
perience and written test marks of these applicants have been tabulated 
in the same manner as their oral marks. The finding is unaffected. The 














106 Norman J. Powell 


second group of applicants averages four per cent higher than the first, 
then there is a decrease of six, then slow increases of one per cent for each 
of the last two groups. The experience marks, likewise, reveal a mixed 
trend. Neither the experience nor the written test marks duplicate the 
oral mark trend. 

By and large, it appears that in the Junior Civil Service Examiner oral 
test there was a decided tendency for oral marks to climb as the test went 
on. The tendency, which seemed to be losing its force toward the end 
of the test is not explicable on the basis of what ability increases are 
defined by the written and experience ratings. 

The situation in regard to fluctuation of standards in the Supervisor, 
Grade 3 examination is shown in Table 2. Unlike the Junior Civil 








Table 2 
Supervisor, Grade 3 Candidates’ Test Marks by Order of Appearance in the Oral Test 
Candidate Median Median 
Group Oral Mark Written Mark 
First 20 75 81 
Second 20 75 80 
Third 20 81 80 
Fourth 20 73 80 
Fifth 20 77 80 
Sixth 20 77 80 
Seventh 20 79 80 
Eighth 20 81 79 





Service Examiner oral marks, there is no smocth trend for marks to rise- 
Ratings show a steep increase at the completion of about one-third the 
test, then a sharp decline, followed by steadiness and rises. Experience 
marks are not available but the written test scores show no such incon- 
sistencies. The ability of the candidates, as represented by their written 
test ratings, approximates a straight line in relation to the order in which 
the candidates were examined in the oral examination. 


The Influence of Quality of Precedessor upon Candidates’ 
Oral Marks 


The immediate problem is to ascertain whether the magnitude of the 
score assigned to a particular candidate influences the mark given to the 
applicant who follows him. The question is: Does the fact that a good 
or poor candidate has just been examined bear any relation to the rating 
awarded to the next person examined? 

To answer this question in the Junior Civil Service Examiner test, 
the 52 marks have been arranged in order of merit. Those persons with 
scores included among the highest 13 marks may be designated as supe- 





Relation of Order of Appearance to Mark in Oral Tests 107 


rior. Examinees rated as among the 13 lowest may be labeled poor. 
The two intermediate groups may be called good and fair respectively. 
The verbal labels are for convenience only and refer to the competence of 
a candidate taken in relation to his competitors. In effect, the quality 
of a candidate is considered as his rank in the total group examined where 
each member of the group is classified on the basis of the quarter of the 
entire distribution into which his score falls. 

In order to simplify the interpretation of the data, a value of 4 has 
been assigned to superior marks, 3 to good, and 2 and 1 respectively to 
fair and poor ratings. The average numerical value has then been com- 
puted of all examinees classified according to the category of ability of 
their predecessors. In each case, for example, where a rating classified 
as superior was given, the nature of the rating of the next examinee was 
noted. An average of the latter marks was then taken. The averages 
of the scores of successor applicants are, then, a rough measure of the 
influence upon them of the character of the preceding candidates. 

The average quality value ascribed to candidates who follow poor 
examinees is, in the Junior Civil Service Examiner test, 2.6. Those who 
are examined immediately after fair candidates average 2.2. Followers 
after good applicants average 2.4 as do followers after persons who are 
superior. Apparently, the quality of a predecessor’s performance exerts 
no especial influence on the marks given to these candidates. None of 
the averages of the sub-groups deviated palpably or consistently from the 
general average of the entire group of candidates. Using the same method 
to analyze the Supervisor, Grade 3 test, the 160 marks have been divided 
into four sub-groups each with 40 scores. Here, the average quality 
value of examinees who happen to follow poor candidates is 2.27, the 
average of the fair sub-group is 2.27, but the averages of the next two 
sub-groups are 2.68 and 2.73. The absolute differences among the sub- 
group averages are small and roughly comparable in magnitude to those 
found in the Junior Civil Service Examiner test. However, the Super- 
visor Grade 3 test reveals a trend not observable in the other test. As 
the quality of the candidate increases, the quality of the next man tested 
tends to increase. The tendency is not great but it is quite consistent 
except that there seems to be no distinction between poor and fair appli- 
cants. The ratings of a pair of persons who are consecutively examined 
tend to resemble each other more closely than would be expected on the 
basis of pure chance. 


Conclusions 


It must be inferred that defects have been found in each of the two 
oral tests studied. The accident of order in which applicants are exam- 

















108 Norman J. Powell 


ined should not be correlated with the mark received if standards are uni- 
formly and rigidly applied. 

Yet the defects found in the two tests are not identical. The sugges- 
tion is that different oral tests possess quite unique characteristics, but 
that examiners ought always to be particularly alert in guarding against 
allowing their standards to be applied sometimes loosely, .sometimes 
rigorously or to permit the relative merit or absence of merit of any ex- 
aminee to influence the mark of the person who is next among those 
tested. 





A Binocular Reading Test 


George Spache 
Friends Seminary and Brooklyn Friends School 


There has been some attention in remedial reading to the significance 
of eyedness or eye preference. Per se, eyedness has not been shown to be 
related markedly to reading achievement (6, 9, 11). This conclusion 
might have been expected in view of the fact that the tests of eyedness 
usually reveal merely the tendency to prefer one eye in sighting activities. 
The experiments have not shown whether eye preference, as measured by 
sighting, causes a pupil to read more with one eye than the other, a 
tendency which would be related to reading achievement. It is apparent 
that there is a real need for investigation of the influence of eyedness upon 
eye performance in reading. 

Better visual acuity in one eye than the other may cause the pupil to 
read more with the good eye. This may result in an undue strain on the 
better eye and some loss in the poorer through disuse (12). Gray (7) 
reports a case in which the left eye read 16 per cent more content in a 
given time than the right eye, and 9.3 per cent more than both eyes. In 
fact, the left eye read with 27.7 per cent fewer errors than both eyes. 
Such a loss of binocular coordination has been shown by Farris (2, 3, 4) 
to be associated with less than normal progress in reading. Thus the 
investigation of the extent of reading with each eye is particularly im- 
portant in the presence of any marked loss in visual acuity, binocular 
coordination or stereopsis, or in suspected instances of suspenopsia or 
fusional difficulties. A further demonstration of the necessity for this 
investigation is contained in the results of Gahagan (5) and Coons and 
Mathias (1). They show that eye preference and visual acuity are not 
always in accord. In some instances, loss of binocular coordination may 
result in a preference for the poorer eye with consequent further strain 
on its capacity and inferior reading because of the visual hindrance. 

At least one diagnostician has considered the tendency to prefer one 
or the other eye in reading (10). Mrs. Robinson, of the Orthogenic 
School of the University of Chicago, has her pupils read one paragraph 
of an appropriate set of the Gray’s Oral Reading Check Tests with each 
separate eye, and a third paragraph with both eyes.1 We have con- 


1 Test cards for this purpose may be obtained from the Keystone View Co. of Mead- 
ville, Pa. 


109 











110 George Spache 


structed a similar set of test cards which may be used in any ordinary 
stereoscope. The first paragraph of each set of Gray’s tests (8) appears 
only on the left half of the test card; the second paragraph, on the right 
half of a second card. The third paragraph is duplicated on both halves 
of a third test card. The stereoscope is adjusted to the child’s vision by 
setting the card holder at a point at which the child secures good fusion. 
This point of fusion is determined by allowing the pupil to adjust the 
stereoscope until he secures clarity and fusion with several ordinary 
stereoscope pictures. We believe the technic valuable since it yields 
three comparable measures of oral reading; permits observation of oral 
reading errors and habits and determines reading efficiency of each and 
both eyes independently. 

In our first few administrations of Robinson’s technic, it became 
apparent that marked differences may exist among monocular and 
binocular reading abilities. In the case of Jimmy H., for example, the 
test results were: Set II, first paragraph read with the left eye in 130 
seconds with 12 errors. The second paragraph, read with the right eye, 
took 118 seconds with 8 errors. The third paragraph of the same set, 
read with both eyes, took 120 seconds with 9 errors. Reading with the 
left eye is apparently slower and more inaccurate than reading with the 
right or both eyes. 

The differences between Jimmy’s monocular and binocular reading 
skills are particularly important in view of his visual acuity. Use of the 
Keystone visual acuity card yielded R 20/60, L 20/122, Both 20/60, or 
70, 40 and 70 per cent respectively of normal acuity. As might logically 
be expected, the efficiency of monocular and binocular reading is directly 
related to visual acuity, that is the poorer reading is done with the 
poorer eye. 

During the testing, Jimmy made a number of attempts to sight or 
read with the left eye, both in monocular and binocular reading. In 
view of the tendency to prefer this poorer eye, we were anxious to deter- 
mine how much actual reading was being accomplished by each eye in 
what was supposed to be a binocular reading. It was important to learn 
whether the preference for reading with the poorer eye was causing the 
boy to do more actual reading with this eye. The testing by means of 
the separate Gray’s Oral Check Test paragraphs does not reveal the pro- 
portion of effort with each eye in reading that we presume is being done 
with both eyes. 

A certain amount of alternating vision, in which perception is achieved 
first with one eye and then with the other, is normal, according to Selzer 
(11) and others. But in instances of suspenopsia, when vision of one 
eye is suppressed in preference for the other, much more or even all of 























A Binocular Reading Test 111 


the reading may be done with the preferred eye. When, as in Jimmy’s 
case, the eye performing the major portion of the reading is much inferior 
in acuity, the matter is obviously important in understanding the pupil’s 
reading difficulties. 

We constructed two tests which we believe will reveal the extent of 
actual reading with each eye in binocular reading. Binocular Reading 
Test A is an original paragraph which is intended for pupils of primitive 
reading abilities. Binocular Reading Test B is an adaptation of the 
fourth paragraph of Set III of Gray’s tests and is intended for pupils with 
reading abilities above the first grade. Whenever possible, we employ 
both tests with each individual. 


Binocular Reading Test A 


My Red Car My Big Car 
My red car My big 
See big See my 


not go car can go 
red car 

not go away can go now 
The red car broken The big 


The big man has fixed The 


car is broken here 
tall man fixed 


the red car now big car 


The red car The big car 

go now can go away 
I will go in big I go the 
My red car My big 
will go home now go home 


red car 
car and I 


Binocular Reading Test B 


Once I saw a queen and her 
escorts pass through the 
main street of a city. It 
was & very sight was a beautiful sight 
because they were dressed because they were all dressed 
in pretty bright colors. The in colors. 
men clad in scarlet, men clad in bright scarlet, 


I saw a queen and her 
many escorts pass through the 
street of a big city. It 


with plumes in their 
caps, were the leaders. A com- 
pany in yellow, with big blue 
plumes, came next in line. 
Another company clad in dark 
green with scarlet plumes 


with long white plumes in their 
caps, were the leaders. A com- 
pany with 
plumes, came next 

Another company clad in 
green with long scarlet plumes 











112 


George Spache 


Binocular Reading Test B—(Continued) 


followed. The air was filled 

with music. Cries 
from the many people were heard. 
Great numbers looked 
down from the windows and threw 
flowers. When the companies 
came to the place for the con- 
test, they passed before 
the queen. 

The queen sat on a lofty 
throne in a central location. 
hind her stood a attractive 
girl, who was tall, and 
graceful. She had beautiful gold- 
en hair, and her eyes sparkled 
like jewels. She was very 
different from the queen, who had 
black hair, dark skin and 
brown eyes. 


Be- 


followed. The air was filled 
with the sound of music. Cries 
from the people were heard. 
numbers of people looked 
down from __ windows and threw 
flowers. When all the companies 
came to the place for the con- 
test, they passed quickly before 
the pretty queen. 
The queen sat on a 
throne in a central location. Be- 
hind her stood a very attractive 
girl, who was slender and 
graceful. She had gold- 
en hair, and her eyes sparkled 
like bright jewels. She was 
different from the queen, who had 
black hair, dark skin and soft 
brown eyes. 





We have attempted to adapt the paragraphs so that a relatively con- 
tinuous story is available for either or both eyes. Each paragraph is 
typed on one half of a stereoscope card, with due care for alignment so 
that the normal reader may readily fuse and read binocularly. Thus the 
paragraphs of Binocular Reading Test A, as given above, would appear 
on opposite halves of the test card. In each binocular reading test, 
twenty words are presented to each eye that are not presented to the 
other eye. To determine the efficiency of effort with each eye, we com- 
pute the per cent of the total number of test words read with each eye. 
For example, in Test B, Diana B. read 33 of the test words, 17 or 52 
per cent with the right eye and 16 or 48 per cent with the left. Seven of 
the test words were omitted. 

In Jimmy’s case, the difference between the efficiency of the separate 
eyes is quite marked. He read only 20 test words of ‘‘ My Big Red Car.” 
These twenty words were those offered to the left eye. He read none 
of the words offered to the right eye. This confirmed the previous im- 
pression that he preferred to read with the left eye, the inferior eye. Both 


visual acuity and reading skill are poorer in this preferred eye with which 
Jimmy does most, if not all, of his reading. This is ample explanation 
for his difficulties in reading. 

We have attempted to construct a test for determining the relative 

















A Binocular Reading Test 113 


efficiency of each eye in an opportunity for binocular reading. We are 
not yet ready to interpret all the results that the binocular reading tests 
yield, nor are we ready to publish any data on validity or reliability. 
But, as demonstrated in Jimmy’s case, the tests do aid in interpreting the 
interrelationship of eyedness, visual acuity and reading ability. We have 
also found the technic helpful in cases of alternating vision, suspenopsia, 
strabismus and faulty fusion. 


10. 


11. 


12. 


Bibliography 


. Coons, J. C., and Mathias, R. J. ‘Eye and Hand Preference Tendencies.” Ped. 


Sem. and J. gen. Psychol., 1928, 35, 629-632. 


. Farris, L. P. “Visual Defects as Factors Influencing Achievement in Reading.” 


Calif. J. sec. Educ., 1934, 10, 50-51. 


. Farris, L. P. “Visual Defects as Factors Influencing Achievement in Reading.” 


Clearing House, 1934, 9, 226-228. 


. Farris, L. P. “Visual Defects as Factors Influencing Achievement in Reading.” 


J. exp. Educ., 1936, 5, 58-66. 


. Gahagan, L. “Visual Dominance-Acuity Relationships.” J. gen. Psychol., 1933, 


9, 455-459. 


. Gates, Arthur I., and Bond, Guy L. “Relation of Handedness, Eye-Sighting and 


Acuity Dominance to Reading.” J. educ. Psychol., 1936, 27, 450-456. 


. Gray, William S. ‘Problems of Reading Disabilities Requiring Scientific Study.” 


Elem. Eng. Rev., 1935, 12, 96-100. 


. Gray, William 8. Gray’s Oral Reading Check Tests. Bloomington, Illinois: Public 


School Publishing Co. 


. Imus, H, A., Rothney, J. W. M., and Bear, R. M. An Evaluation of Visual Factors 


in Reading. Hanover, New Hampshire: Dartmouth College, 1938. 

Robinson, Helen M. “The Study of Disabilities in Reading.” Elem. Sch. J. 
1937, 38, 15-28. 

Selzer, Chas. A. Lateral Dominance and Visual Fusion. Harv. Monogr. in Educ., 
1933, 12. 

Spache, George. ‘The Role of Visual Defects in Reading and Spelling Disabilities.” 
Amer. J. Orthopsychiat, 1940, 10, 229-238. 











News and Notes 


The Special Service Division, SOS, U. 8. Army has recently issued a 
Catalog of the Army Institute. The bulletin describes the Army Institute 
in detail, giving to the soldier full information as to how he may continue 
his education at the high school level and at the college level by means of 
over 700 correspondence courses. Seventy-six colleges and universities 
have agreed to make their extension courses available to the enlisted 
personnel of the Army. Further information may be obtained from the 
Commandant, Army Institute, Madison, Wisconsin. 





Dr. William 8S. Learned reports the results of comparing college marks 
and Graduate Record Examination test scores for 1,659 students in 
eleven Eastern liberal arts colleges and Dr. Charles R. Langmuir de- 
scribes the Graduate Record Examination Testing Program from 1936 to 
the present in the Thirty-Seventh Annual Report of the Carnegie Founda- 
tion for the Advancement of Teachers issued on January 18, 1943. 





The January 15, 1943, issue of U. 8S. Office of Education’s Education 
for Victory describes a project to obtain an inventory of every high school 
junior and senior in the State of Indiana by means of tests and personal 
ratings. 





The American Youth Commission announces that Post-war Youth 
Employment by Paul T. David will be published in March, 1943. 





The American Association for Applied Psychology newsletter of 
January 25, 1943, contained the following announcement: “To those of 
our members who are especially interested in post-war planning, Dr. 
Gardner Murphy of the College of the City of New York extends an 
invitation to cooperate in a study of the psychological aspects of planning 
the peace now in process by the Society for the Psychological Study of 
Social Issues. Dr. Murphy writes: ‘It is perfectly obvious that our group 
will gradually lose its identity in a larger psychological and social science 
project, and that is as it should be. We feel that, for the present, we are 
doing something worthwhile in getting psychologists oriented towards 
real post-war service. The more intimate the cooperation between 


114 








News and Notes 115 


various groups, the better we’ll like it.’ Write to Dr. Murphy directly 
if you are interested in cooperating in this project.” 





The American Association for Applied Psychology announces the 
launching of a new monograph series, entitled Applied Psychology Mono- 
graphs with Herbert S. Conrad, College Entrance Examination Board, 
Princeton, N. J., as Editor. The series will be published by Stanford 
University Press. Inquiries concerning the new series and manuscripts 
for publication should be addressed to Dr. Conrad. 





Dr. Mark A. May, as chairman of the Committee on Motion Pictures 
of the American Council on Education, announces the publication of 
Films for America at War. This document supplies for the teacher, or 
leader of adult groups, essential information on 114 war-related films. It 
forms the first supplement to Selected Educational Motion Pictures: A 
Descriptive Encyclopedia. 











Book Reviews 


Luckiesh, M., and Moss, F. K. Reading as a visual task. New York: 
D. Van Nostrand, 1942. Pp. x + 428. $5.00. 


The hygiene of reading is concerned with those factors which affect 
the ease, comfort and speed with which reading is done. Obviously an 
effective and healthful working environment is desirable for any person 
doing visual work. This book deals ‘‘with factors which are readily 
controllable and which can be utilized toward making seeing easier.’ 
Major concern is with visibility and readability of print. The techniques 
of measurement employed are outlined, results presented, and specifica- 
tions for optimal readability given. The experimental studies which 
form the basis for the discussion are chiefly the author’s work although 
there is an abundance of important material written by German, English 
and American scientists. Much of this material, however, does not 
agree with what Luckiesh and Moss present. 

The terms visibility and readability of print are preferred to legibility. 
By means of the authors’ visibility meter, relative visibility of various 
typographical factors and illumination effects were measured. Read- 
ability is defined as ease of seeing in reading. It was appraised by meas- 
urement of rate of blinking, muscular tension, heart rate, fatigue of ex- 
trinsic eye muscles, eye movements, pupillary changes, and reading rate. 

The authors admit that high visibility does not always guarantee high 
readability. Obviously visibility is a factor when brightness contrast 
between ink and paper is less than optimum. Contrary to the authors’ 
view, however, relative visibility is important only when discrimination 
of details is difficult. With type sizes above 8 point, for example, relative 
visibility is unimportant when line width and leading are optimal. The 
main reason that type sizes larger than 12 point are rejected by the authors 
is because their use is impractical although more visible. In reality 
other factors reduce readability when larger type sizes are employed. In 
general, visibility of type is accepted as an important measure only if it 
agrees with the authors’ favored measure of readability. 

The measure of readability preferred and most emphasized by 
Luckiesh and Moss is rate of blinking. The number of involuntary blinks 
occurring during a 5 minute period of reading or at the beginning and end 
of a longer period were counted by direct observation. It is assumed that 
this reveals the degree of tension or state of fatigue of thesubject. Other 
experimentalists have concluded that the rate of blinking is not a valid 


116 








Book Reviews 117 


index of visual fatigue. Its real function is still debatable. It has also 
been found that heart rate is a questionable measure of ease of reading. 
There is, therefore, considerable doubt concerning the validity of these 
techniques as measures of ease of seeing. 

The authors go to great lengths in an attempt to prove that rate of 
reading is of little or no value as a criterion of readability. An examina- 
tion of their method for measuring rate reveals an extreme lack of experi- 
mental controls. The subjects were instructed to read at their normal 
rate. This may lead to dawdling. In the single instance where maximal 
speed of reading was measured the artificial situation of a moving text 
of unequated difficulty precludes application to normal reading situations. 
In reading at normal rates, neither comparable forms of reading text nor 
any measure of comprehension was employed. It has been shown by 
others that ‘‘reading”’ without comprehension results in a very consistent 
pace, 7.e., rate might be unaltered by changes in typography. Further- 
more, as few as 10 subjects are used for some comparisons. It is obvious 
that speed of reading measured in this manner is of dubious value as a 
measure of readability. Other investigators have demonstrated that 
speed of reading is a sensitive measure of readability. It is of interest 
that the authors do emphasize eye movements as a measure of read- 
ability, apparently unaware that recording eye movements is another 
way of measuring speed of reading. 

Noteworthy examples of erroneous or inconsistent techniques and 
interpretations follow: (1) In most comparisons the geometric mean is 
used. It is questionable whether the geometric mean is appropriate for 
the data presented. (2) Data derived from study of book type is errone- 
ously applied to visibility of newspaper type (p. 136). (3) It is stated 
that 8 point type is not uncommon in textbooks (p. 136). A survey of 
1000 text-books reveals that 8 point type occurs in less than one-half 
of one per cent. (4) Erroneous inferences concerning influence of mar- 
gins and inter-column spacing are made without data. (5) On page 113 
a change of 5 to 8 per cent is minimized; on page 160-61, 5 to 10 per 
cent assumes importance, and on page 180 ‘‘no great significance should 
be attached to differentials less than one or two per cent.”’ But on page 
225, a 3.8 per cent difference is considered not statistically significant. 
And on page 151, one and two per cent differences (6 vs. 10, 6 vs. 12 point 
type) are considered to indicate differences in readability. (6) In certain 
instances a five minute test period is considered inadequate (p. 151), in 
other instances conclusions are derived without qualifications from 5 
minute tests. (7) The fact that the eye adapts to clear seeing at various 
levels of illumination intensity is ignored. 

After citing their experiments on typography and lighting, specifica- 














118 Book Reviews 


tions for optimum readability are given: (1) 12 point type or larger, (2) 
100 foot-candles of light intensity, (3) maximal brightness between ink 
and paper, (4) non-glossy paper and ink, (5) a type face of moderate 
boldness, (6) clear-cut delineation of characters in duplicated materials, 
(7) brightness of visual surroundings not less than one-third that of 
reading material, (8) avoidance of preventable glare, (9) adequate cor- 
rection of visual defects, (10) 3 point leading for 10 and 12 point type 
in 21 pica line lengths (widths), (11) 13 pica line length (width) for 10 
point type with 2 point leading, (12) ample margins and (13) a wider 
space between columns than ordinarily used. But this book is printed in 
11 point textype, 23 pica line width with 2 point leading on moderately 
glossy paper. 

Little detailed criticism of specifications is necessary. Items 3, 4, 6, 
7, 8, and 9 above appear adequate in terms of research findings. All 
other specifications, however, may be considered highly questionable 
because they are either formulated without evidence, derived by faulty 
techniques or based upon inadequate evidence. 

Much material in this treatise represents a real contribution to the 
hygiene of vision: (1) certain visibility measurements, (2) study of dupli- 
cate materials, (3) evidence on effects of brightness contrast, and (4) 
discussion of visual deficiencies. Many of the authors’ techniques of 
measurement, however, need additional evaluation by research workers 
in other laboratories. Furthermore no treatise can be adequate that 
ignores certain worthwhile research contributions in the field. Perhaps 
the greatest deficiency of the book is in the misinterpretations, many of 
which are due to inadequate appreciation of certain fundamental princi- 
ples of reading. 

Mixes A. TINKER 


University of Minnesota 








New Books, Monographs, and Pamphlets 


Books, monographs, and pamphlets for review should be sent to Donald 
G. Paterson, Editor, Department of Psychology, University of 
Minnesota, Minneapolis, Minnesota. 

Survey of Objective Studies of Psychoanalytic Concepts. Robert R. Sears. 
Social Science Research Council, 230 Park Avenue, New York City, 
1943. SSRC Bulletin 51. $1.25. 

Religion and Health. Seward Hiltner. The Macmillan Company, New 
York City, 1943. $2.50. 

Mental Health in College. Clements C. Fry and Edna G. Rostow. The 
Commonwealth Fund, New York, 1943. $2.00. 365 pp. 

The Use of Test Results in Diagnosis and Instruction in the Tool Subjects. 
Arthur E. Traxter. Educational Records Bureau, New York, 1942. 
Educational Records Bulletin; No. 18 (Revised). 80 pp. 

Australian Council for Educational Research. Twelfth Annual Report. 
Melbourne, C.1, Australia. 1942. 44 pp. 

The Wartime Adjustment of the Exceptional Child. Proceedings of the 
Eighth Institute on Education of the Exceptional Child. The Woods 
School, Langhorne, Pennsylvania, 1942. 50 pp. 

Check List for Reviewing a Reading Curriculum. Laura K. Eads. Divi- 
sion of Curriculum Research Bulletin No. 1. Bureau of Education, 
New York, New York, 1942. 22 pp. 

Thirty-seventh Annual Report. The Carnegie Foundation for the Ad- 
vancement of Teaching, New York, New York, 1943. Gratis. 
180 pp. 

Reading Readiness in the First Grade. Harriet B. Morrison. Educational 
Research Bulletin No. 5, Bureau of Reference, Research and Statistics, 
Board of Education, New York, New York, 1942. 44 pp. 

Catalog of the Army Institute. Special Service Division, Services of 
Supply, War Department, Washington, D. C., 1942. 44 pp. 





























AMERICAN PSYCHOLOGICAL PERIODICALS 





American cqmnales Sepeeeuse—aans. » N. Y.; Cornell Unive Subscription $6.50. 624 annually. 
ted by K. M. Dallenbach, Madison Ben and E, . Boring. Quarterly. Genmelond enan. 
parchohoay. Founded 1887. ~ . 


mental 


msree(vlumes). 1000 poets annually. Ea p, Moses Soe ournal Press. Subscription $14.00 ann 
5 Heh weegea clywne OR. Edited Cart Murchison, Quarterly. oe beha’ oe, epuieal 


Papier Review— Northwestern University, E Sates As Amaia 1 Association, Inc. 
Ee” ions 8 aa annually. Edited S. Langfeld. -monthly. General 


Prpehricteet Monographs— soreaenaere Uninadty. Brageain, iiinstes American 1 Association, 
Subscription $6.00 per volume ane, ee. Edited by John F. Dashiell. Wi t fixed dates, 
Snchbeae end or matinee F 1895. 


= aot $7.00. 6 ag = a Edited by John i. Anderson, Senate tie doe 
cee ey faa pe oy onthly num 
Archives of New York, N. Y.; Columbia University. 

Eidted by R. &: Woodworth, Withou cy mere Rieger: AER en ge ae pct Ene pre Founted 


. we a pee men 3S 00. Sa cae ponenib wg Dated ee nee 
Quarterly. Founded 1906, 


curnal of Educational Baltimore, Md; Warwick & York. Subscription, $6.00. 720 
5 annually. “Edited by J. W. Dunlap, P. M. Symonds, and H PUL femec. Edsaitly Gates Fone te 


Paychoanalytic Review—New York, N. Y.; 64 West S6th St. Subscription $6.00. S00 pages annually. 
Tdited by Sunith Ely Jellite’ Quartciin’ Wounded 19 . 


Journal of nm gg Northwestern University, Evanston, Illinois; American 
— Seen W. Fecaberses Sulcptlon 's eee S oe cre (2 volumes). 1040 pages annually. Edi by 
uel W 


Jonmaal 9 of Anties Northwestern University, Evanston, Illinois; American cal Asso- 
ci ciation, ne, | Subecri $6.00. 480 pagesannuaily. Edited by Donald G. Paterson. Bi-monthly. 


of Comparative Psychology — , Md.; Williams & Wilkins Co. Subscription $14.00 
(2 volumes). "1000 pages annually lig. dlied ty Rov ML. Deccan Knight Dechnn eal bootie von 


Comparative Psychology ; Williams & Wilkins Co. Subscription $6.00 
volume. 400 pa 100 pases. Cie oe Without fixed dates, each eutti heathens Wnahe resensch. 


Senetennually.-“Raited Sy ‘Carl Murchison, | pinons, The Junwant Eves. a. 2 ag s 
hy ber 
i any ‘ae -monthiy. 5 Each num yeahh apne Iecaemare Cinid 


Stbecription $7.00. 100 gugesa 700 lidited by isan HL Ansbacher ggg ee 


no Genet iPro ournal Press. Subscription $14.00 

J (2 volumes et sly Edited by sail M Murchison. Quarterly. Ade ack rob gr mena 
ete: kaa The Journal Press. Subscription $7.00. 500 pages 

J annually. ted by og romeey et Be 5 7 Ml lg Quarterly. Poutat tacit nd deren! 


BAY se Broadway. Subscription $6.00. 560 pages annuall 
Porcnemieed by Betas De Quarterly. ‘ounded 1932. “ 


Character and Personality—Durham, N.C.; Duke University Press. Subscription $2.00. 360 pages annual! 
Edited by KarlZener. Quarterly. ee “ 


Jenn se ee aoa tgurens Pveas. on ee 


Psychometrika—University of dela Ree: . Subscription $10.00. 320 pages 
Founded a ta and others. Quarterly. er iabebie nae 


saree, Seca tick snapagrsemety. (ies 


i! Record— 
by J.R. Renter and C. =: 
psychology. Founded 193 


ournal 5 Cones Pepys mamas, Penn; Goltaee Co. Subscription $3.00. 240 pages 
5 damenliy. ted by J. P. Symonds. Bi-monthly. Founded 1937. * 





